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(57) Abstract 

The present invention concerns fusion of Fc domains with biologically active peptides and a process for preparing pharmaceutical 
agents using biologically active peptides. In this invention, pharmacologically active compounds are prepared by a process comprising: a) 
selecting at least one peptide that modulates the activity of a protein of interest; and b) preparing a pharmacologic ^.^y^J^ 
domain covalently linked to at least one amino acid of the selected peptide. Linkage to the vehicle mcreases the halMife of the ^r^ptide 
which otherwise would be quickly degraded in vivo. The preferred vehicle is an Fc domain. The peptide is preferably selected by phage 
display, E. coli display, ribosome display, RNA-peptide screening, or chemical-peptide screening. 
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Modified Peptides as Therapeutic Agents 
Background of the Invention 

Recombinant proteins are an emerging class of therapeutic agents. 
5 Such recombinant therapeutics have engendered advances in protein 
formulation and chemical modification. Such modifications can protect 
therapeutic proteins, primarily by blocking their exposure to proteolytic 
enzymes. Protein modifications may also increase the therapeutic 
protein's stability, circulation time, and biological activity. A review 

1 0 article describing protein modification and fusion proteins is Francis 
(1992), Focus on Growth Factors 3:4-10 (Mediscript, London), which is 
hereby incorporated by reference. 

One useful modification is combination with the "Fc" domain of an 
antibody. Antibodies comprise two functionally independent parts, a 

1 5 variable domain known as "Fab", which binds antigen, and a constant 
domain known as "Fc", which links to such effector functions as 
complement activation and attack by phagocytic cells. An Fc has a long 
serum half-life, whereas an Fab is short-lived. Capon et al. (1989), Nature 
337: 525-31. When constructed together with a therapeutic protein, an Fc 

2 0 domain can provide longer half-life or incorporate such functions as Fc 
receptor binding, protein A binding, complement fixation and perhaps 
even placental transfer. Id. Table 1 summarizes use of Fc fusions known in 
the art. 
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Table 1— Fc fusion with therapeutic proteins 



Form of Fc 


Fusion 
partner 


Therapeutic 
implications 


Reference 


lgGl 


N-terminus of 
CD30-L 


Hodgkin's disease; 
anaplastic lymphoma; T- 


U.S. Patent No. 
5,480,981 


Murine Fcy2a 


IL-10 


anti-inflammatory; 
transpiani repciion 


Zheng fiLal. (1995), *L 
Immupol. 154: 5590-600 


lgG1 


TNF receptor 


septic shock 


"Fisher fiLaL (1996), 
FnnL J. Med. 334:1697- 
1702; Van Zee, K.fiLaL 
(1996). J Immunol. 156: 
2221-30 


IgG, IgA, 
IgM, or IgE 
(excluding 
the first 
domain) 


TNF receptor 


innammaiion, cluiuiiihmumg 
disorders 


U.S. Pat. No. 5,808,029, 
issued September 15, 
1998 


lgG1 


CD4 receptor 


AIDS 


Capon fiLal (1989), 
Nature 337: 525-31 


lgG1, 
laG3 


N-terminus 
of IL-2 


anti-cancer, antiviral 


HarvillfiLaL0995), 
Imrflunotech. 1:95-105 


igGi 


C-terminus of 
OPG 


osteoarthritis; 
bone density 


WO 97/23614, published 
Julv 3.1997 


lgGl 


N-terminus of 
leptin 


anti-obesity 


PCT/US 97/23183, filed 
December 11. 1997 


Human Ig 
Cy1 


CTLA-4 


autoimmune disorders 


Linsley (1991), skEx^ 
Med. 174:561-9 



A much different approach to development of therapeutic agents is 
peptide library screening. The interaction of a protein ligand with its 
receptor often takes place at a relatively large interface. However, as 
demonstrated for human growth hormone and its receptor, only a few key 
residues at the interface contribute to most of the binding energy. 
Clackson etal. (1995), Science 267: 383-6. The bulk of the protein ligand 
merely displays the binding epitopes in the right topology or serves 
functions unrelated to binding. Thus, molecules of only "peptide" length 
(2 to 40 amino acids) can bind to the receptor protein of a given large 
protein ligand. Such peptides may mimic the bioactivity oTthelarge 
protein ligand ("peptide agonists") or, through competitive binding, 
inhibit the bioactivity of the large protein ligand ("peptide antagonists"). 
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Phage display peptide libraries have emerged as a powerful 
method in identifying such peptide agonists and antagonists. See, for 
example, Scott etal. (1990), Science 249: 386; Devlin et al . (1990), Science 
249: 404; U.S. Pat. No. 5,223,409, issued June 29, 1993; U.S. Pat. No. 
5 5,733,731, issued March 31, 1998; U.S. Pat. No. 5,498,530, issued March 12, 
1996; U.S. Pat. No. 5,432,018, issued July 11, 1995; U.S. Pat. No. 5,338,665, 
issued August 16, 1994; U.S. Pat. No. 5,922,545, issued July 13, 1999; WO 
96/40987, published December 19, 1996; and WO 98/15833, published 
April 16, 1998 (each of which is incorporated by reference). In such 

1 0 libraries, random peptide sequences are displayed by fusion with coat 
proteins of filamentous phage. Typically, the displayed peptides are 
affinity-eluted against an antibody-immobilized extracellular domain of a 
receptor. The retained phages may be enriched by successive rounds of 
affinity purification and repropagation. The best binding peptides may be 

15 sequenced to identify key residues within one or more structurally related 
families of peptides. See, e.g., Cwirla etal. (1997), Science 276: 1696-9, in 
which two distinct families were identified. The peptide sequences may 
also suggest which residues may be safely replaced by alanine scanning or 
by mutagenesis at the DNA level. Mutagenesis libraries may be created 

2 0 and screened to further optimize the sequence of the best binders. 
Lowman (1997), Ann. Rev. Biophvs. Biomol. Struct. 26: 401-24. 

Structural analysis of protein-protein interaction may also be used 
to suggest peptides that mimic the binding activity of large protein 
ligands. In such an analysis, the crystal structure may suggest the identity 

2 5 and relative orientation of critical residues of the large protein ligand, 
from which a peptide may be designed. See, e.g., Takasaki et al. (1997), 
Nature Biotech. 15: 1266-70. These analytical methods may-alsctbe used to „ 
investigate the interaction between a receptor protein and peptides 



3 
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selected by phage display, which may suggest further modification of the 
peptides to increase binding affinity. 

Other methods compete with phage display in peptide research. A 
peptide library can be fused to the carboxyl terminus of the lac repressor 
5 and expressed in EcoU. Another Rcoli-based method allows display on 
the cell's outer membrane by fusion with a peptidoglycan-associated 
lipoprotein (PAL). Hereinafter, these and related methods are collectively 
referred to as "Rcoli display." In another method, translation of random 
RN A is halted prior to ribosome release, resulting in a library of 
10 polypeptides with their associated RNA still attached. Hereinafter, this 
and related methods are collectively referred to as "ribosome display." 
Other methods employ chemical linkage of peptides to RNA; see, for 
example, Roberts & Szostak (1997), Proc Natl. Acad. Sci. USA, 94: 12297- 
303. Hereinafter, this and related methods are collectively referred to as 
15 "RNA-peptide screening." Chemically derived peptide libraries have been 
developed in which peptides are immobilized on stable, non-biological 
materials, such as polyethylene rods or solvent-permeable resins. Another 
chemically derived peptide library uses photolithography to scan peptides 
immobilized on glass slides. Hereinafter, these and related methods are 
2 0 collectively referred to as "chemical-peptide screening." Chemical-peptide 
screening may be advantageous in that it allows use of D-amino acids and 
other unnatural analogues, as well as non-peptide elements. Both 
biological and chemical methods are reviewed in Wells & Lowman (1992), 
Curr. Opin. Riotechnol. 3: 355-62. 

Conceptually, one may discover peptide mimetics of any protein 
using phage display and the other methods mentioned above. These 
methods have been used for epitope mapping, for identification of critical 
amino acids in protein-protein interactions, and as leads for the discovery 
of new therapeutic agents. E.g., Cortese etaL (1996), Curr. Opin. Biotech. 7: 

1 
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616-21. Peptide libraries are now being used most often in immunological 
studies, such as epitope mapping. Kreeger (1996), The Scientist 10(13): 19- 
20. 

Of particular interest here is use of peptide libraries and other 
5 techniques in the discovery of pharmacologically active peptides. A 
number of such peptides identified in the art are summarized in Table 2. 
The peptides are described in the listed publications, each of which is 
hereby incorporated by reference. The pharmacologic activity of the 
peptides is described, and in many instances is followed by a shorthand 
1 0 term therefor in parentheses. Some of these peptides have been modified 
(e.g., to form C-terminally cross-linked dimers). Typically, peptide 
libraries were screened for binding to a receptor for a pharmacologically 
active protein (e.g., EPO receptor). In at least one instance (CTLA4), the 
peptide library was screened for binding to a monclonal antibody. 
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Table 2— Pharmacologically active peptides 



Form of 
peptide 

intrapeptide 
disulfide- 
bonded 



Binding 
partner/ 

protein of 
interest* 

EPO receptor 



C-terminally 
cross-linked 
dimer 



EPO receptor 



linear 



EPO receptor 



linear 



c-MpI 



C-terminally 
cross-linked 
dimer 
disulfide- 
linked dimer 



c-MpI 



alkylene- 
linked dimer 



Pharmacologic 
activity 

EPO-mimetic 



Reference 



WrightoneLal. (1996), 
Science 273: 458-63; 
U.S. Pat. No. 5,773,569, 
issued June 30,1998 to 
Wriohton fiLal. 



EPO-mimetic 



Livnah fiLal. (1996), 
Science 273: 464-71; 
Wrighton fiUl. (1997), 

N^umRiotfichnQloqYis: 
1261-5; International 
patent application WO 
96/40772, published 
Dec. 19, 1996 



EPO-mimetic 
TPO-mimetic 



NarandafiUl. (1999), 
p mff , r^tl Arad. Sci. 
USA. 96: 7569-74 



CwirlafiUl.(1997) 
Science 276: 1696-9; 
U.S. Pat. No. 5,869,451, 
issued Feb. 9,1999; U.S. 
Pat. No. 5,932,946, 
issued Aug. 3,1999 



TPO-mimetic 



CwirlaeLal. (1997), 
Science 276: 1696-9 



stimulation of 
hematopoiesis 
("G-CSF-mimetic") 



G-CSF-mimetic 



linear 



IL-1 receptor 



inflammatory and 
autoimmune diseases 
("IL-1 antagonist" or 
"IL-lra-mimetic") 



PaukovitsfiLal. (1984T 
Hoppft-fiftvlers Z. 
P hysinl. Chem . 365: 303- 
11;Laerum fiUl. (1988), 

F* p. Hemat.16: 274-80 

~ Bhatnagar fiLal. (1 996), 

,i M fi d. Chem. 39: 3814- 
9; Cuthbertson fiLal. 
(1QQ7) j Mftri.Chem. 
40: 2876-82; King fiUL 
(1991), Fxp, Hematol 
19:481; King sLal- 
(1995), Stood 86 (Suppl. 

1): 309a _ 

U.S. Pat. No. 5,608,035; 
U.S. Pat. No. 5,786,331; 
U.S^Pat. No. 5,880,096;^ 
Yanofsky fiLal. (1996), 



ea?Sfy whether the molecule is bound by or mim.cked by the pepUdes. 
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Pron. Natl. Acad. Sci. 93: 

7381-6; AkesonsisJ- 
(19961 J. Biol. Chem . 
271:30517-23; 
Wiekzorekeial.(1997), 
Pol. J. Pharmacol. 49: 



107-17;Yanofsky(l996), 
PNAs, 93:7381-7386. 



linear 


Facteur 
serique (FTS) 


stimulation of 

l\/m nhnp\/toc 

lympnocyies 
("FTS-mimetic") 


Inagaki-Ohara fiLal. 

/1QQA^ r*olh liar Immnnnl 

( i a»o;, veimiar Immunol- 
171:3(M0;Yoshida 
(1984), IDLJ. 

6:141-6. 


intra npntirip 
ii in apcp tiuc 

disulfide 
bonded 




C\T\ AA-mi motif* 

\-/ 1 l_M*+ Mill lit? no 


Piikumntrt ot sit /1QQA\ 

Nature Piotech, 16:267- 

70 




i iMr-a receptor 


i iNr-ct aniagonisi 


TakacaUi of ol MQQ7^ 

Nature Biotech. 15:1266- 
70; WO 98/53842, 
published December 3, 
1998 


linear 


TNF-a receptor 


TNF-a antagonist 


Chirinos-Rojas ( ) ti L 
Imm 5621-5626 


intrapeptide 
disulfide 
bonded 


C3b 


inhibition of complement 
activation; autoimmune 
diseases 
( M C3b-antagonisD 


Sahu fiLal. (1996), J, 
Immunol. 157: 884-91; 
Morikis et al. (1998), 
Protein Sci. 7:619-27 


linear 


vinculin 


cell adhesion processes— 
ceil gruwin, uirrereniiauon, 
wound healing, tumor 
metastasis ("vinculin 
binding") 


AdeyeUl. (1997), 

Rinrhom 1 IDA' ^O^-fi 


linear 


C4 binding 
protein (C4BP) 


antithrombotic 


Linse fiLal. (1997) l4 L 
BtaL Chem. 272: 14658- 
65 


linear 


urokinase 
receptor 


processes associated with 
urokinase interaction with 

its receptor (e.g., 
angiogenesis, tumor cell 
invasion and metastasis); 
( U UKR antagonist") 


Goodson fiLal. (1994), 
Proc, Natl, Acad, Sci. 91: 
7129-33; International 
application WO 
97/35969, published 
October 2, 1997 


linear 


Mdm2, Hdm2 


Inhibition of inactivation of 
p53 mediated by Mdm2 or 

hdm2; anti-tumor 
("Mdm/hdm antagonist") 


Picksley et al. (1994), 
Oncoaene 9: 2523-9: 
BottgereUl. (1997) J* 
MoLBiQi. 269: 744-56; 
Bottger fiLal. (1996), 
Oncogene 13: 2141-7 


" linear 




anti-tumor by mimicking 
the activity of p21 WAF1 


-BaiffiLal^ (1997), Cull- 

Biol. 7: 71-80 


linear 


farnesyl 


anti-cancer by preventing 


Gibbs et al. (1994) r Cell 



b FTS is a thymic hormone mimicked by the molecule of this invention rather than a 
receptor bound by the molecule of this invention. 

0 
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linear 



linear 



linear 



linear 



linear 



linear 



linear 
linear 



linear, 
cyclized 



linear, 
cyclized- 



transferase 



pMiwatinn nf ras oncogene 77:175-178 



Ras effector 
domain 



anti-cancer by inhibiting 
biological function of the 
ras oncogene 



SH2/SH3 anti-cancer by inhibiting 
domains tumor growth with 

activated tyrosine kinases 



Moodie et 31.(1994), 
Trends Genet 10: 44-48 
Rodriguez etal. (1994), 
Nature 370:527-532 



Pawsonetal (1993), 
Hurr Biol. 3:434-432 
Yu etal. (1994), Cell 
76:933-945 



p16" 



Src, Lyn 



anti-cancer by mimicking 
activity of p16;e.g M 
inhibiting cyclin D-Cdk 
complex ("pie-mimetic 1 1 
inhibition of Mast cell 
activation, IgE-related 

conditions, type I 
hypersensitivity ( M Mast 
cell antagonist") 



Mast cell 
protease 



treatment of inflammatory 
disorders mediated by 
release of tryptase-6 
("Mast cell protease 
inhibitors") 



FAhraeussLal. (1996), 
QuniBiQl. 6:84-91 



StauffersUl. 0997), 
Binchem . 36:9388-94 



International application 
WO 98/33812, published 
August 6, 1998 



SH3 domains 



HBV core 
antigen (HBcAg) 

selectins 



treatment of SH3- 
mediated disease states 
("SH3 antagonist") 



treatment of HBV viral 
infections ("anti-HBV") 

neutrophil adhesion; 
inflammatory diseases 
fselectin antagonist") 



RickleseUl. (1994), 
EMBCL1.13: 5598-5604; 
SpaitesLal- (1994), A 
Rinl. Chem . 269: 23853- 
6; Sparks fiLal. (1996), 
Pm N atl Acad. ScL 93: 
1540-4 
Dyson &Muray (1995), 
aocJM^cad^asi- 92: 

2194-8 

Martens sUL (l995),iL 
Rinl- Chem . 270:21129- 
36; European patent 
application EP0 714 
912, published June 5, 
1996 



calmodulin 



calmodulin antagonist 



integrins 



tumor-homing; treatment 
for conditions related to 
integrin-mediated cellular 
events, including platelet 
aggregation, thrombosis, 
wound healing, 
osteoporosis, tissue 
rargiran Qiooenesis (e.£ 



Pierce sUl. (1995), 
M^ry Diversity 1:259- 
65; Dedman eLal. 
(1993), 1 «nl Chem. 
268: 23025-30; Adey & 
Kay (1996), QfiDfi 169: 

133-4 

International applications 
WO 95/14714, published 
Jurl*r1,1995;WO 
97/08203, published 
March 6, 1997; WO 
98/10795, published 
March 19, 1998; WO 
99/24462, published May 
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for treatment of cancer), 
and tumor invasion 
rinteqrin-binding") 


20,1999; Kraft sial. 
(1999). J. Biol. Chem. 
274:1979-1985 


cyclic, linear 


fibronectin and 
extracellular 
matrix 
components of T 
cells and 
macrophaaes 


treatment of inflammatory 
and autoimmune 
conditions 


WO 98/09985, published 
March 12, 1998 


linear 


somatostatin 
and cortistatm 


treatment or prevention of 
normone-prouuciNy 
tumors, acromegaly, 
giantism, dementia, 
gastric ulcer, tumor 

nrnu/th inhihitinn nf 

UlUWUI, II II ItUlUWI 1 vl 

hormone secretion, 
modulation of sleep or 
nfiural activity 


European patent 
amplication 0 91 1 393. 

uVsLsIIwCImw I l w w I l www, 

published April 28,1999 


linear 


bacterial 
npopoiysac- 
charide 


antibiotic; septic shock; 
Htenrriftrc modulatable bv 
CAP37 


U.S. Pat. No. 5,877,151, 
issued March 2, 1999 


linear or 
cyclic, 
including D- 


para ax in, meiiiiin 


antinalhnnfinic 

cii ui^/cili tuyoi iiw 


WO 97/31019, published 
28 August 1997 


linear, cyclic 


VIP 


impotence, 
neurodegenerative 
disorders 


WO 97/40070, published 
October 30, 1997 


linear 


CTLs 


cancer 


EP 0 770 624, published 
Mav2,1997 


linear 


THF-gamma2 




Burnstein (1988), 
Biochem.. 27:4066-71. 


linear 


Amylin 




Cooper (1987), Riqc* 
Natl. Acad. ScL 
84:8628-32. 


linear 


Adrenomeouiun 




Ritamura (1993)! BBRC. 
192:553-60. 


cyclic, linear 


VEGF 


anti-angiogenic; cancer, 
rheumatoid arthritis, 
diabetic retinopathy, 
psoriasis ("VEGF 
antagonist") 


Fairbrother (1998), 
Biochem.. 37:17754- 
17764. 


cyclic 


[VI mr 


inflammation and 
autoimmune disorders; 
tumor growth 
CMMP inhibitor") 


Koivunen (1999), Nature 
Biotech.. 17:768-774. 




HGH fraament 




U.S. Pat. No. 5.869,452 




Echistatin 


inhibition of platelet 
aggreaation 


Gan (1988), J, Bjol, 
Chem.. 283:19827-32. 


linear 


SLE 
autoantibody 


SLE 


WO 96/30057, published 
October 3, 1996 




GD1 alpha 


suppression of tumor 
metastasis 


IshikawaelsJ. (1998), 
F^BSLett. 441 (1):20-4 




antiDhospholipld 


endothelial cell activation , 


Blank etal. (1999), ElQ&x 


9 
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10 



15 



20 



beta-2 
glycoprotein-l 
(P2GPI) 
antibodies 



Nntl ^ afi sni.USA96: 
5164-8 




25 



Peptides identified by peptide library screening have been regarded 
as "leads" in development of therapeutic agents rather than as therapeutic 
agents themselves. Like other proteins and peptides, they would be 
rapidly removed invivo either by renal filtration, cellular clearance 
mechanisms in the reticuloendothelial system, or proteolytic degradation. 
Francis (1992), Focus on growth Factors 3: 4-11. As a result, the art 
presently uses the identified peptides to validate drug targets or as 
scaffolds for design of organic compounds that might not have been as 
easily or as quickly identified through chemical library screening. 
Lowman (1997), A^mRev^iophys Biomol. Struct 26: 401-24; Kay etal. 
(1998), Disc. Today 3: 370-8. The art would benefit from a process by 
which such peptides could more readily yield therapeutic agents. 

Summary of the Invention 
The present invention concerns a process by which the injavo half- 
life of one or more biologically active peptides is increasedby fusion with 
a vehicle. In this invention, pharmacologically active compounds are 
prepared by a process comprising: 

a) selecting at least one peptide that modulates the activity of a 

protein of interest; and 

b) preparing a pharmacologic agent comprising at least one 
vehicle covalently linked to at least one amino acid sequence 
of the selected peptide. 

The preferred vehicle is an Fc domain. The peptides screened in step (a) 
are preferably expressed in a phage display library. The vehicle and the 

to 
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peptide may be linked through the N- or C-terminus of the peptide or the 
vehicle, as described further below. Derivatives of the above compounds 
(described below) are also encompassed by this invention. 

The compounds of this invention may be prepared by standard 
5 synthetic methods, recombinant DNA techniques, or any other methods of 
preparing peptides and fusion proteins. Compounds of this invention that 
encompass non-peptide portions may be synthesized by standard organic 
chemistry reactions, in addition to standard peptide chemistry reactions 
when applicable. 

1 0 The primary use contemplated is as therapeutic or prophylactic 

agents. The vehicle-linked peptide may have activity comparable to— or 
even greater than — the natural ligand mimicked by the peptide. In 
addition, certain natural ligand-based therapeutic agents might induce 
antibodies against the patient's own endogenous ligand; the vehicle-linked 

1 5 peptide avoids this pitfall by having little or typically no sequence identity 
with the natural ligand. 

Although mostly contemplated as therapeutic agents, compounds 
of this invention may also be useful in screening for such agents. For 
example, one could use an Fc-peptide (e.g., Fc-SH2 domain peptide) in an 

2 0 assay employing anti-Fc coated plates. The vehicle, especially Fc, may 
make insoluble peptides soluble and thus useful in a number of assays. 

The compounds of this invention may be used for therapeutic or 
prophylactic purposes by formulating them with appropriate 
pharmaceutical carrier materials and administering an effective amount to 

25 a patient, such as a human (or other mammal) in need thereof. Other 
related aspects are also included in the instant invention. 

Numerous additional aspects and advantages of the present 
invention will become apparent upon consideration of the figures and 
detailed description of the invention. 
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Brief Description of the Figures 
Figure 1 shows a schematic representation of an exemplary process 
of the invention. In this preferred process, the vehicle is an Fc domain, 
which is linked to the peptide covalently by expression from a DNA 
5 construct encoding both the Fc domain and the peptide. As noted in 
Figure 1, the Fc domains spontaneously form a dimer in this process. 

Figure 2 shows exemplary Fc dimers that may be derived from an 
IgGl antibody. "Fc" in the figure represents any of the Fc variants within 
. the meaning of "Fc domain" herein. "X 1 " and "X'" represent peptides or 
1 o linker-peptide combinations as defined hereinafter. The specific dimers are 
as follows: 

A, D: Single disulfide-bonded dimers. IgGl antibodies typically 
have two disulfide bonds at the hinge region between the constant and 
variable domains . The Fc domain in Figures 2A and 2 D may be formed by 

15 truncationbetween the two disulfide bond sites or by substitution of a 
cysteinyl residue with an unreactive residue (e.g., alanyl). In Figure 2A, 
the Fc domain is linked at the amino terminus of the peptides; in 2D, at the 

carboxyl terminus. 

B, E: Doubly disulfide-bonded dimers. This Fc domain may be 
2 0 formed by truncation of the parent antibody to retain both cysteinyl 

residues in the Fc domain chains or by expression from a construct 
including a sequence encoding such an Fc domain. In Figure 2B, the Fc 
domain is linked at the amino terminus of the peptides; in 2E, at the 

carboxyl terminus. 
25 C F- Noncovalent dimers. This Fc domain may be formed by 

elimination of the cysteinyl residues by either truncation or substitution. 
One may desire to eliminate the cysteinyl residues to avoid impunt.es - 
formed by reaction of the cysteinyl residue with cysteinyl residues of other 
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proteins present in the host cell. The noncovalent bonding of the Fc 
domains is sufficient to hold together the dimer. 

Other dimers may be formed by using Fc domains derived from different 
types of antibodies (e.g., IgG2, IgM). 
5 Figure 3 shows the structure of preferred compounds of the 

invention that feature tandem repeats of the pharmacologically active 
peptide. Figure 3A shows a single chain molecule and may also represent 
the DNA construct for the molecule. Figure 3B shows a dimer in which the 
linker-peptide portion is present on only one chain of the dimer. Figure 3C 

1 0 shows a dimer having the peptide portion on both chains. The dimer of 
Figure 3C will form spontaneously in certain host cells upon expression of 
a DNA construct encoding the single chain shown in Figure 3A. In other 
host cells, the cells could be placed in conditions favoring formation of 
dimers or the dimers can be formed in vitro . 

1 5 Figure 4 shows exemplary nucleic acid and amino acid sequences 

(SEQ ID NOS: 1 and 2, respectively) of human IgGl Fc that may be used in 
this invention. 

Figure 5 shows a synthetic scheme for the preparation of PEGylated 

peptide 19 (SEQ ID NO: 3). 
2 0 Figure 6 shows a synthetic scheme for the preparation of PEGylated 

peptide 20 (SEQ ID NO: 4). 

Figure 7 shows the nucleotide and amino acid sequences (SEQ ID 

NOS: 5 and 6, respectively) of the molecule identified as "Fc-TMP" in 

Example 2 hereinafter. 
2 5 Figure 8 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 7 and 8, respectively) of the molecule identified as "Fc-TMP-TMP" in 

Example 2 hereinafter. - — 
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Figure 9 shows the nucleotide and amino acid sequences (SEQ. ID. 
NOS: 9 and 10, respectively) of the molecule identified as <TMP-TMP-Fc" 

in Example 2 hereinafter. 

Figure 10 shows the nucleotide and amino add sequences (SEQ. ID. 
NOS: 11 and 12, respectively) of the molecule identified as "TMP-Fc" in 

Example 2 hereinafter. 

Figure 11 shows the number of platelets generated jnyjyo in 
normal female BDF1 mice treated with one 100 ug/kg bolus injection of 
various compounds, with the terms denned as follows. 

PEG-MGDF: 20 kD average molecular weight PEG attached by 
reductive amination to the N-terrninal amino group of amino 
acids 1-163 of native human TPO, which is expressed in E.coli 
(so that it is not glycosylated); 
TMP: the TPO-mimetic peptide having the amino acid sequence 
1 5 IEGPTLRQWLAARA (SEQ ID NO: 13); 

TMP-TMP: the TPO-mimetic peptide having the amino acid 
sequence IEGPTLRQWLAARA-GGGGGGGG- 

IEGPTLRQWLAARA (SEQ ID NO: 14); 
PEG-TMP-TMP: the peptide of SEQ ID NO: 14, wherein the PEG 
20 group is a 5 kD average molecular weight PEG attached as 

shown in Figure 6; 
Fc-TMP-TMP: the compound of SEQ ID NO: 8 (Figure 8) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 
are bound to the corresponding Cys residues in the second 
2 5 monomer to form a dimer, as shown in Figure 2); and 

TMP-TMP-Fc is the compound of SEQ ID NO: 10 (Figure 9) 

dimerized in the same way as TMP-TMP-Fc except that the Fc, 
domain is attached at the C-terminal end rather than the N- 
terminal end of the TMP-TMP peptide. 
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Figure 12 shows the number of platelets generated in vivo in 
normal BDF1 mice treated with various compounds delivered via 
implanted osmotic pumps over a 7-day period. The compounds are as 
defined for Figure 7. 
5 Figure 13 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 15 and 16, respectively) of the molecule identified as "Fc-EMP" in 
Example 3 hereinafter. 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 17 and 18, respectively) of the molecule identified as "EMP-Fc" in 
1 0 Example 3 hereinafter. 

Figure 15 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:19 and 20, respectively) of the molecule identified as "EMP-EMP-Fc" 
in Example 3 hereinafter. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
1 5 NOS: 21 and 22, respectively) of the molecule identified as "Fc-EMP-EMP" 
in Example 3 hereinafter. 

Figures 17A and 17B show the DNA sequence (SEQ ID NO: 23) 
inserted into pCFM1656 between the unique Aat ll (position #4364 in 
pCFM1656) and Sac ll (position #4585 in pCFM1656) restriction sites to 
2 0 form expression plasmid pAMG21 (ATCC accession no. 98113). 

Figure 18A shows the hemoglobin, red blood cells, and hematocrit 
generated in vivo in normal female BDF1 mice treated with one 100 ^g/kg 
bolus injection of various compounds. Figure 18B shows the same results 
with mice treated with 100 jig/kg per day delivered the same doac by 7- 
2 5 day micro-osmotic pump with the EMPs delivered at 100 yg/kg, rhEPO at 
30U/mouse. (In both experiments, neutrophils, lymphocytes, and platelets 
were unaffected.) In these figures, the terms are defined asTfolldws. 

Fc-EMP: the compound of SEQ ID NO: 16 (Figure 13) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 are 

tt 
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bound to the corresponding Cys residues in the second monomer to 
form a dimer, as shown in Figure 2); 

EMP-Fc: the compound of SEQ ID NO: 18 (Figure 14) dimerized in 
the same way as Fc-EMP except that the Fc domain is attached at 
5 the C-terminal end rather than the N-terminal end of the EMP 

peptide. 

"EMP-EMP-Fc" refers to a tandem repeat of the same peptide (SEQ 
ID NO: 20) attached to the same Fc domain by the carboxyl 
terminus of the peptides. "Fc-EMP-EMP" refers to the same tandem 

1 o repeat of the peptide but with the same Fc domain attached at the 

amino terminus of the tandem repeat. All molecules are expressed 
in E. coli and so are not glycosylated. 

Figures 19A and 19B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1055 and 1056) of the Fc-TNF-oc inhibitor fusion molecule 
1 5 described in Example 4 hereinafter. 

Figures 20A and 20B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1057 and 1058) of the TNF-a inhibitor-Fc fusion molecule 
described in Example 4 hereinafter. 

Figures 21 A and 21B show the nucleotide and amino acid sequences 

2 0 (SEQ ID NOS: 1059 and 1060) of the Fc-IL-1 antagonist fusion molecule 

described in Example 5 hereinafter. 

Figures 22A and 22B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1061 and 1062) of the IL-1 antagonist-Fc fusion molecule 
described in Example 5 hereinafter. 
2 5 Figures 23A, 23B, and 23C show the nucleotide and amino acid 

sequences (SEQ ID NOS: 1063 and 1064) of the Fc-VEGF antagonist fusion 
molecule described in Example 6 hereinafter. 
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Figures 24A and 24B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1065 and 1066) of the VEGF antagonist-Fc fusion molecule 
described in Example 6 hereinafter. 

Figures 25A and 25B show the nucleotide and amino acid sequences 
5 (SEQ ID NOS: 1067 and 1068) of the Fc-MMP inhibitor fusion molecule 
described in Example 7 hereinafter. 

Figures 26A and 26B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1069 and 1070) of the MMP inhibitor-Fc fusion molecule 
described in Example 7 hereinafter. 
1 0 Detailed Description of the Invention 

Definition of Terms 

The terms used throughout this specification are defined as follows, 
unless otherwise limited in specific instances. 

The term "comprising" means that a compound may include 

1 5 additional amino acids on either or both of the N- or C- termini of the 
given sequence. Of course, these additional amino acids should not 
significantly interfere with the activity of the compound. 

The term "vehicle" refers to a molecule that prevents degradation 
and/or increases half-life, reduces toxicity, reduces immunogenicity, or 

2 0 increases biological activity of a therapeutic protein. Exemplary vehicles 
include an Fc domain (which is preferred) as well as a linear polymer (e.g., 
polyethylene glycol (PEG), polylysine, dextran, etc.); a branched-chain 
polymer (see, for example, U.S. Patent No. 4,289,872 to Denkenwalter et 
alv issued September 15, 1981; 5,229,490 to Tarn, issued July 20, 1993; WO 

2 5 93/21259 by Frechet etaL, published 28 October 1993); a lipid; a 

cholesterol group (such as a steroid); a carbohydrate or oligosaccharide; or 
any natural or synthetic protein, polypeptide or peptide that binds to a 
salvage receptor. Vehicles are further described hereinafter. 
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The term "native Fc" refers to molecule or sequence comprising the 
sequence of a non-antigen-binding fragment resulting from digestion of 
whole antibody, whether in monomeric or multimeric form. The original 
immunoglobulin source of the native Fc is preferably of human origin and 
5 may be any of the immunoglobulins, although IgGl and IgG2 are 

preferred. Native Fc's are made up of monomeric polypeptides that may 
be linked into dimeric or multimeric forms by covalent (i.e., disulfide 
bonds) and non-covalent association. The number of intermolecular 
disulfide bonds between monomeric subunits of native Fc molecules 

1 0 ranges from 1 to 4 depending on class (e.g., IgG, IgA, IgE) or subclass (e.g., 
IgGl, IgG2, IgG3, IgAl, IgGA2). One example of a native Fc is a disulfide- 
bonded dimer resulting from papain digestion of an IgG (see Ellison etal. 
(19821 Nucleic Acids Res. 10: 4071-9). The term "native Fc" as used herein 
is generic to the monomeric, dimeric, and multimeric forms. 

15 The term "Fc variant" refers to a molecule or sequence that is 

modified from a native Fc but still comprises a binding site for the salvage 
receptor, FcRn. International applications WO 97/34631 (published 25 
September 1997) and WO 96/32478 describe exemplary Fc variants, as 
well as interaction with the salvage receptor, and are hereby incorporated 

20 by reference. Thus, the term "Fc variant" comprises a molecule or 

sequence that is humanized from a non-human native Fc. Furthermore, a 
native Fc comprises sites that may be removed because they provide 
structural features or biological activity that are not required for the fusion 
molecules of the present invention. Thus, the term "Fc variant" comprises 

2 5 a molecule or sequence that lacks one or more native Fc sites or residues 
mat affect or are involved in (1) disulfide bond formation, (2) 
incompatibility with a selected host ceU (3) N-terminal heterogeneity upon 
expression in a selected host cell, (4) glycosylate, (5) interaction with 
complement, (6) binding to an Fc receptor other than a salvage receptor, or 
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(7) antibody-dependent cellular cytotoxicity (ADCC). Fc variants are 
described in further detail hereinafter. 

The term "Fc domain" encompasses native Fc and Fc variant 
molecules and sequences as defined above. As with Fc variants and native 
5 Fc's, the term "Fc domain" includes molecules in monomelic or 

multimeric form, whether digested from whole antibody or produced by 
other means. 

The term "multimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two or more 
1 0 polypeptide chains associated covalently, noncovalently, or by both 
covalent and non-covalent interactions. IgG molecules typically form 
dimers; IgM, pentamers; IgD, dimers; and IgA, monomers, dimers, 
trimers, or tetramers. Multimers may be formed by exploiting the 
sequence and resulting activity of the native Ig source of the Fc or by 
1 5 derivatizing (as defined below) such a native Fc. ^ 
The term "dimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two polypeptide chains 
associated covalently or non-covalently. Thus, exemplary dimers within 
the scope of this invention are as shown in Figure 2. 
20 The terms "derivatizing" and "derivative" or "derivatized" 

comprise processes and resulting compounds respectively in which (1) the 
compound has a cyclic portion; for example, cross-linking between 
cysteinyl residues within the compound; (2) the compound is cross-linked 
or has a cross-linking site; for example, the compound has a cysteinyl 
2 5 residue and thus forms cross-linked dimers in culture or in vivo: (3) one or 
more peptidyl linkage is replaced by a non-peptidyl linkage; (4) the N- 
terminus is replaced by -NRR 1 , NRCfOJR 1 , -NRC(0)OR-, -NRSCO)^ 1 , - 
NHC(0)NHR, a succinimide group, or substituted or unsubstituted 
benzyloxycarbonyl-NH-, wherein R and R 1 and the ring substituents are 
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as defined hereinafter; (5) the C-terminus is replaced by -C(0)R 2 or -NR 3 R 4 
wherein R 2 , R 3 and R 4 are as denned hereinafter; and (6) compounds in 
which individual amino acid moieties are modified through treatment 
with agents capable of reacting with selected side chains or terminal 
5 residues. Derivatives are further described hereinafter. 

The term "peptide" refers to molecules of 2 to 40 amino acids, with 
molecules of 3 to 20 amino acids preferred and those of 6 to 15 amino acids 
most preferred. Exemplary peptides may be randomly generated by any 
of the methods cited above, carried in a peptide library (e.g., a phage 
10 display library), or derived by digestion of proteins. 

The term "randomized" as used to refer to peptide sequences refers 
to fully random sequences (e.g., selected by phage display methods) and 
sequences in which one or more residues of a naturally occurring molecule 
is replaced by an amino acid residue not appearing in that position in the 
15 naturally occurring molecule. Exemplary methods for identifying peptide 
sequences include phage display, E^coh display, ribosome display, RNA- 
peptide screening, chemical screening, and the like. 

The term "pharmacologically active" means that a substance so 
described is determined to have activity that affects a medical parameter 
2 0 (e.g., blood pressure, blood cell count, cholesterol level) or disease state 
(e.g., cancer, autoimmune disorders). Thus, pharmacologically active 
peptides comprise agonistic or mimetic and antagonistic peptides as 
defined below. 

The terms "-mimetic peptide" and "-agonist peptide" refer to a 
2 5 peptide having biological activity comparable to a protein (e.g., EPO, TPO, 
G-CSF) that interacts with a protein of interest. These terms further 
include peptides that indirectly mimic the activity of a protein of interest, 
such as by potentiating the effects of the natural ligand of the protein of 
interest; see, for example, the G-CSF-mimetic peptides listed in Tables 2 
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and 7. Thus, the term "EPO-mimetic peptide" comprises any peptides that 
can be identified or derived as described in Wrighton et al. (1996), Science 
273: 458-63, Naranda etal (1999), Proc. Natl. Acad. Sci. USA 96: 7569-74, 
or any other reference in Table 2 identified as having EPO-mimetic subject 
5 matter. Those of ordinary skill in the art appreciate that each of these 
references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 
libraries. 

The term "TPO-mimetic peptide" comprises peptides that can be 
1 0 identified or derived as described in Cwirla etal. (1997), Science 276: 1696- 
9 , U.S. Pat. Nos. 5,869,451 and 5,932,946 and any other reference in Table 2 
identifed as having TPO-mimetic subject matter, as well as the U.S. patent 
application, // Thrombopoietic Compounds/' filed on even date herewith 
and hereby incorporated by reference. Those of ordinary skill in the art 
1 5 appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

The term "G-CSF-mimetic peptide" comprises any peptides that 
can be identified or described in Paukovits etal. (1984), Hoppe-Sevlers Z. 
2 0 Physiol. Chem . 365: 303-11 or any of the references in Table 2 identified as 
having G-CSF-mimetic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 
2 5 The term "CTLA4-mimetic peptide" comprises any peptides that 

can be identified or derived as described in Fukumoto etal. (1998), Nature 
Biotech . 16: 267-70. Those of ordinary skill in the art appreeiateihat each of 
these references enables one to select different peptides than actually 
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disclosed therein by following the disclosed procedures with different 

peptide libraries. 

The term "-antagonist peptide" or "inhibitor peptide" refers to a 
peptide that blocks or in some way interferes with the biological activity of 
5 the associated protein of interest, or has biological activity comparable to a 
known antagonist or inhibitor of the associated protein of interest. Thus, 
the term "TNF-antagonist peptide" comprises peptides that can be 
identified or derived as described in Takasaki etal. (1997), Nature Biotech . 
15: 1266-70 or any of the references in Table 2 identified as having TNF- 

1 o antagonistic subject matter. Those of ordinary skill in the art appreciate 

that each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

The terms "IL-1 antagonist" and "IL-lra-mimetic peptide" 

15 comprises peptides that inhibit or down-regulate activation of the IL-1 
receptor by IL-1. IL-1 receptor activation results from formation of a 
complex among IL-1, IL-1 receptor, and IL-1 receptor accessory protein. 
IL-1 antagonist or IL-lra-mimetic peptides bind to IL-1, IL-1 receptor, or 
IL-1 receptor accessory protein and obstruct complex formation among 

20 any two or three components of the complex. Exemplary IL-1 antagonist 
or IL-lra-mimetic peptides can be identified or derived as described in 
U.S. Pat. Nos. 5,608,035, 5,786,331, 5,880,096, or any of the references in 
Table 2 identified as having IL-lra-mimetic or IL-1 antagonistic subject 
matter. Those of ordinary skill in the art appreciate that each of these 

2 5 references enables one to select different peptides than actually disclosed 

therein by following the disclosed procedures with different peptide 
libraries. 

The term "VEGF-antagonist peptide" comprises peptides that can 
be identified or derived as described in Fairbrother (1998), Biochem. 37: 

» 
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17754-64, and in any of the references in Table 2 identified as having 
VEGF-antagonistic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
5 procedures with different peptide libraries. 

The term "MMP inhibitor peptide" comprises peptides that can be 
identified or derived as described in Koivunen (1999), Nature Biotech. 17: 
768-74 and in any of the references in Table 2 identified as having MMP 
inhibitory subject matter. Those of ordinary skill in the art appreciate that 
1 0 each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

Additionally, physiologically acceptable salts of the compounds of 
this invention are also encompassed herein. By "physiologically 
1 5 acceptable salts" is meant any salts that are known or later discovered to 
be pharmaceutically acceptable. Some specific examples are: acetate; 
trifluoroacetate; hydrohalides, such as hydrochloride and hydrobromide; 
sulfate; citrate; tartrate; glycolate; and oxalate. 
Structure of compounds 
2 0 In General . In the compositions of matter prepared in accordance 

with this invention, the peptide may be attached to the vehicle through the 
peptide's N-terminus or C-terminus. Thus, the vehicle-peptide molecules 
of this invention may be described by the following formula I: 
I 

wherein: 

F 1 is a vehicle (preferably an Fc domain); 

X 1 and X 2 are each independently selected from -(V)-P\ -d 1 )-? 1 - 

(L\ -P 2 , -a VP'-aVP 2 -^-^ and -(L 1 ) C -P , -(LVP 2 -(L 3 ) € -P 3 -(L 4 ) r P 4 

V 
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P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

V, V, V, and L 4 are each independently linkers; and 

a, b, c, d, e, and f are each independently 0 or 1, provided that at 

5 least one of a and b is 1. 

Thus, compound I comprises preferred compounds of the formulae 

n 

X'-F 1 

and multimers thereof wherein F 1 is an Fc domain and is attached at the C- 
10 terminus of X 1 ; 

m 

F'-X 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of X J ; 
15 IV 

FXLVP 1 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 

terminus of -(L'VP 1 ; and 

V 

FMLVPMLVP 2 
and multimers thereof wherein F l is an Fc domain and is attached at the N- 

terminus of -L 1 -P , -L 2 -P J . 

Peptides . Any number of peptides may be used in conjunction with 
the present invention. Of particular interest are peptides that mimic the 
activity of EPO, TPO, growth hormone, G-CSF, GM-CSF, IL-lra, leptin, 
CTLA4, TRAIL, TGF-a, and TGF-p. Peptide antagonists are also of 
interest, particularly those antagonistic to the activity of TNF, leptin, any 
of the interleukins (IL-1, 2, 3, ...), and proteins involved in complement 
activation (e.g., C3b). Targeting peptides are also of interest, including 



20 



25 
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tumor-homing peptides, membrane-transporting peptides, and the like. 
All of these classes of peptides may be discovered by methods described in 
the references cited in this specification and other references. 

Phage display, in particular, is useful in generating peptides for use 
5 in the present invention. It has been stated that affinity selection from 
libraries of random peptides can be used to identify peptide ligands for 
any site of any gene product. Dedman etal. (1993), I. Biol Chem . 268: 
23025-30. Phage display is particularly well suited for identifying peptides 
that bind to such proteins of interest as cell surface receptors or any 
1 0 proteins having linear epitopes. Wilson etal (1998), Can. T, Microbiol. 44: 
313-29; Kay etal. (1998), Drug Disc. Today 3: 370-8. Such proteins are 
extensively reviewed in Herz etal. (1997), I. Receptor & Si gnal 
Transduction Res . 17(5): 671-776, which is hereby incorporated by 
reference. Such proteins of interest are preferred for use in this invention. 
15 A particularly preferred group of peptides are those that bind to 

cytokine receptors. Cytokines have recently been classified according to 
their receptor code. See Inglot (1997), Archivum Immunologiae et 
Therapiae Experimentalis 45: 353-7, which is hereby incorporated by 
reference. Among these receptors, most preferred are the CKRs (family I in 
2 0 Table 3). The receptor classification appears in Table 3. 
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family 

I. Hematopoietic 
cytokines 



Table 3— Cytokine Receptors 
C ytokines (ligands) 



II. IL-1G ligands 



subfamih 

1. IL-2, IL-4, IL-7, 
IL-9, IL-13, IL- 
15 

2. IL-3, IL-5, GM- 
CSF 

3. IL-6JL-11.IL- 
12.LIF.OSM, 
CNTF, leptin 
(OB) 

4. G-CSF.EPO, 
TPO. PRL. GH 

5. IL-17. HVS-IL- 
17 

IL-10.BCRF-1. 
HSV-IL-10 



III. Interferons 

IV. IL-1 ligands 



1. |FN-al.a2,a4, 
m.t. IFN-p r 

2. IFN- _ 
1. IL-1a,IL-1p,IL- 

1Ra 



Classified by Receptor Code 
Receptor Type 



family 

Cytokine R 
(CKR) 



subfamily 

1. shared ?Cr 



2. shared GP 140 
PR 

3. 3.sharedRP 
130 



4. "single chain" R 

5. other R c 



II. IL-10R 



V. TNF ligands 



1. TNF-a, TNF-p 
(LT). FAS1. 
CD40L. 
CD30LCD27L 



VI. Chemokines 



1. a chemokines: 
IL-8, GRO 
a.p.Y.IF-10. 
PF-4, SDF-1 

2. p chemokines: 
MIP1a, MIP1P. 
MCP-1 ,2,3,4. 
RANTES, 
eotaxin 

3. y chemokines: 
lymphotactin 



IV. 



Interferon R 

1L¥ 



1. lFNAR 

2. IFNGR 



V. NGF/TNF R* 



VI. ChemokineR 1- CXCR 

2. CCR 

3. CR 

4. DARC 



< aher I FN type I subtypes remain ""^^^J^^gnalir^g molecules for the 
interferons do not, possess "--?6 and IL-18 have been 

• TNF receptors use '^^S^SC^Vc^^ NGF/TNF 
protein-coupled, seven transme rX*or that can bind several different 
transduction events remain unclear. 
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VII. Growth factors 



10 



15 



20 



1.1 SCF.M-CSF, 
PDGF-AA, AB, 
BB, FLT-3L, 
VEGF, SSV- 
PDGF 

1.2 FGFa, FGFB 
1.3EGF.TGF-0, 

W-F19(EGF- 
like) 

1.4IGF-I, IGF-II, 

Insulin 
1.5 NGF, BDNF, 

NT-3.NT-4' 
2. TGF-61 .62.83 



VII. RKF 



1. TK sub-family 
1.1 IgTKIIIR 



1.2 IgTKIVR 

1.3 Cysteine-rich 
TK-I 

1.4 Cysteine rich 
TK-II 

1.5 Cysteine knot 
TK V 

2. STK subfamily" 



Exemplary peptides for this invention appear in Tables 4 through 
20 below. These peptides may be prepared by methods disclosed in the 
art. Single letter amino acid abbreviations are used. The X in these 
sequences (and throughout this specification, unless specified otherwise in 
a particular instance) means that any of the 20 naturally occurring amino 
acid residues may be present. Any of these peptides may be linked in 
tandem (i.e., sequentially), with or without linkers, and a few tandem- 
linked examples are provided in the table. Linkers are listed as "A" and 
may be any of the linkers described herein. Tandem repeats and linkers 
are shown separated by dashes for clarity. Any peptide containing a 
cysteinyl residue may be cross-linked with another Cys-containing 
peptide, either or both of which may be linked to a vehicle. A few cross- 
linked examples are provided in the table. Any peptide having more than 
one Cys residue may form an intrapeptide disulfide bond, as well; see, for 
example, EPamimetic peptides in Table 5. A few examples of 
intrapeptide disulfide-bonded peptides are specified in the table. Any of 
these peptides may be derivatized as described herein, and a few 
derivatized examples are provided in the table. Derivatized peptides in - 
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the tables are exemplary rather than limiting, as the associated 
underivatized peptides may be employed in this invention, as well. For 
derivatives in which the carboxyl terminus may be capped with an amino 
group, the capping amino group is shown as -NH, For derivatives in 
which amino acid residues are substituted by moieties other than amino 
acid residues, the substitutions are denoted by a, which signifies any of 
the moieties described in Bhatnagar etal. (1996), T.Med.Chem . 39: 3814-9 
and Cuthbertson etal. (1997), T.Med.Chem. 40: 2876-82, which are 
incorporated by reference. The J substituent and the Z substituents (Z,, Z v 
. . .Z J are as defined in U.S. Pat. Nos. 5,608,035 ,5,786,331, and 5,880,096, 
which are incorporated by reference. For the EPO-mimetic sequences 
(Table 5), the substituents X, through X n and the integer "n" are as defined 
in WO 96 /40772, which is incorporated by reference. The substituents 
and "+" are as defined in Sparks etal. (1996), Pror. Natl. Acad. Sci . 93: 

5 1540-4, which is hereby incorporated by reference. X„ X,, X y and X, are as 
defined in U.S. Pat. No. 5,773,569, which is hereby incorporated by 
reference, except that: for integrin-binding peptides, X„ X 2 , X y X, X? X,, X 7 , 
and X 8 are as defined in International applications WO 95/14714, 
published June 1, 1995 and WO 97/08203, published March 6, 1997, which 

0 are also incorporated by reference; and for VJP-mimetic peptides, X,, X, 1 , 
X " X,, X„ X„ X y X 6 and Z and the integers m and n are as defined in WO 
97/40070, pubhshed October 30, 1997, which is also incorporated by 
reference. Xaa and Yaa below are as defined in WO 98/09985, published 
March 12, 1998, which is incorporated by reference. AA„ AA,, AB„ AB 2 , 

5 and AC are as defined in International application WO 98/53842, 

published December 3, 1998, which is incorporated by reference. X 1 , X 2 , X 3 , 
and X 4 in Table 17 only are as-defined in European application EP 0 911 
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393, published April 28, 1999. Residues appearing in boldface are D- 
amino acids. All peptides are linked through peptide bonds unless 
otherwise noted. Abbreviations are listed at the end of this specification. In 
the "SEQ ID NO." column, "NR" means that no sequence listing is required 
5 for the given sequence. 



Table 4— IL-1 antagonist peptide sequences 



Seauenre/fitmrhirp 


ID NO: 




212 




907 


7 Yf\7 V7 w 


908 


y y r\y V7 77 '■■ 


909 


7 7 7 r\7 V7 7 7 


910 


7777777777"77"7^ v/t —r -» i 


917 


7 M7 .77777777 


979 


1 AIM VobrtW T PYYWQPYALPL 


213 


oW 1 UYb YWUrYALrloGL 


214 


1 1 rr 1 WfcfcoNAYYWQPYALPL 


215 


PMTVCDM\A/AnCMV\A//"\DVAI Dl 
CIM 1 TorlAIWAUoMYVVVjrYALrL 


216 


ovvacUnlMrW 1 otYWUr YALrL 


217 


DGYDRWRQSGERYWQPYALPL 


Zlo 


FEWTPGYWQPY 


219 


FEWTPGYWQHY 


220 


FEWTPGWYQJY 


221 


AcFEWTPGWYQJY 


222 


FEWTPGWpYQJY 


223 


FAWTPGYWQJY 


224 i 


FEWAPGYWQJY 


225 


FEWVPGYWQJY 


226 


FEWTPGYWQJY 


227 


AcFEWTPGYWQJY 


228 


FEWTPaWYQJY 


229 


FEWTPSarWYQJY 


230 


FEWTPGYYQPY 


231 


FEWTPGWWQPY 


232 


FEWTPNYWQPY ! 


233 


FEWTPvYWQJY 


234 


FEWTPecGYWQJY 


235 


FEWTPAibYWQJY 


236 


FEWTSarGYWQJY 


237 


FEWTPGYWQPY 


238 


FEWTPGYWQHY 


239 


FEWTPGWYQJY 


240 
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AcFEWTPGWYQJY 


241 


FEWTPGW-pY-QJY , 


242 


FAWTPGYWQJY 


243 


FEWAPGYWQJY 


244 


FEWVPGYWQJY 


245 


FEWTPGYWQJY 


246 


AcFEWTPGYWQJY 


247 


FEWTPAWYQJY 


248 


FFWTPSarWYQJY 


249 


FEWTPGYYQPY . -. 


250 


FEWTPGWWQPY 


251 


FEWTPNYWQPY 


252 


FEWTPVYWQJY 


253 


FEWTPecGYWQJY 


254 


FEWTPAibYWQJY 


255 


F EWTSarGYWQJ Y 


256 


FEWTPGYWQPYALPL 


257 


1 NapEWTPGYYQJY 


258 


YEWTPGYYQJY 


259 


FEWVPGYYQJY 


260 


FEWTPSYYQJY . 


261 


FEWTPNYYQJY 


262 


TKPR 


263 


RKSSK 


264 | 


RKQDK 


265 


NRKQDK 


266 


RKQDKR 


267 


ENRKQDKRF 


268 


VTKFYF 


269 


VTKFY 


270 


VTDFY 


271 1 


SHLYWQPYSVQ 


671 


TLVYWQPYSLQT 


672 


RGDYWQPYSVQS 


673 


VHVYWQPYSVQT 


674 


RLVYWQPYSVQT 


675 


RRVWFQPYSLQS 


676 


KIMVYWQPYSIQT 


677 


RWFWOPYSVQT 


678 


TFVYWQPYALPL . 


679 


TLVYWQPYSIQR 


680 


RLVYWQPYSVQR 


681 


SPVFWQPYSIQI . 


682 


WIEWWQPYSVQS 


683 


SLIYWQPYSLQM 


" 684 


TRLYWQPYSVQR . — 


: ~ 685 


RCDYWQPYSVQT 


686 


MRVFWQPYSVQN , _ 


687 


KIVYWQPYSVOT . 


n 688 


RHLYWQPYSVQR 


689 
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ALVWWQPYSEQI 


690 


SRVWFQPYSLQS 


691 


WEQPYALPLE 


692 


QLVWWQPYSVQR 


693 


DLRYWQPYSVQV 


694 


ELVWWQPYSLQL 


695 


DLVWWQPYSVQW 


696 


NGNYWQPYSFQV 


697 


ELVYWQPYSIQR 


698 


ELMYWQPYSVQE 


699 


NLLYWQPYSMQD 


700 


GYEWYQPYSVQR 


701 


SRVWYQPYSVQR 


702 


LSEQYQPYSVQR 


703 


GGGWWQPYSVQR 


704 


VGRWYQPYSVQR 


705 


VHVYWQPYSVQR 


706 


QARWYQPYSVQR 


707 


VHVYWQPYSVQT 


708 


RSVYWQPYSVQR 


709 


TRVWFQPYSVQR 


710 


GRIWFQPYSVQR 


711 


GRVWFQPYSVQR 


712 


ARTWYQPYSVQR 


713 


ARVWWQPYSVQM 


714 


RLMFYQPYSVQR 


715 


ESMWYQPYSVQR 


716 


HFGWWQPYSVHM 


717 


ARFWWQPYSVQR 


718 


RLVYWQ PYAPIY 


719 I 


RLVYWQ PYSYQT 


720 


RLVYWQ PYSLPI 


721 


RLVYWQ PYSVQA 


722 


SRVWYQ PYAKGL 


723 


SRVWYQ PYAQGL 


724 


SRVWYQ PYAMPL 


725 


SRVWYQ PYSVQA 


726 


SRVWYQ PYSLGL 


727 


SRVWYQ PYAREL 


728 


SRVWYQ PYSRQP 


729 


SRVWYQ PYFVQP 


730 


EYEWYQ PYALPL 


731 


IPEYWQ PYALPL 


732 


SRIWWQ PYALPL 


733 


DPLFWQ PYALPL 


734 


SRQWVQ PYALPL 


735 ! 


IRSWWQ PYALPL 


736 


RGYWQ PYALPL 


737 


RLLWVQ PYALPL 


738 


EYRWFQ PYALPL 


739 
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DAYWVQ PYALPL 


740 


WSGYFQ PYALPL 


741 


NIEFWQ PYALPL 


742 


TRDWVQ PYALPL 


743 


DSSWYQ PYALPL 


744 


IGNWYQ PYALPL 


745 


NLRWDQ PYALPL 


746 


LPEFWQ PYALPL 


747 


DSYWWQ PYALPL 


748 


RSQYYQ PYALPL 


749 


ARFWLQ PYALPL 


750 


NSYFWQ PYALPL 


751 


PFMYWOPYSVQR 


752 


AHLFWQPYSVQR 


753 


WWQPYALPL 


754 


YYQPYALPL 


755 


YFQPYALGL 


756 


YWYOPYA' Pi 


757 


RWWQPYATPL 


758 


GWYQPYALGF 


759 


YWYOPVAI RL 


760 


IWYQPYAMPL 


761 


SNMQPYQRLS 


762 


TFVYWQPY AVGLPAAETACN 


763 


TFVYWQPY SVQMTITGKVTM 


764 


TFVYWQPY SSHXXVPXGFPL 


765 


TFVYWQPY YGNPQWAIHVRH 


766 | 


TFVYWQPY VLLELPEGAVRA 


767 


TFVYWQPY VDYVWPIPIAQV 


768 


GWYQPYVDGWR 


769 


RWEQPYVKDGWS 


770 


EWYQPYALGWAR 


771 


GWWQPYARGL 


772 


LFEQPYAKALGL 


773 


GWEQPYARGLAG 


774 


AWVQPYATPLDE 


775 


MWYQPYSSQPAE 


776 


GWTQPYSQQGEV 


777 


DWFQPYSIQSDE 


778 


PWIQPYARGFG 


779 


RPLYWQPYSVQV 


/ Ow 


TLIYWQPYSVQI 


781 


R FDYWQP YSDQT 


782 


WHQFVQPYALPL 


783 


EWDS VYWQPYSVQ TLLR 


784 


* WEQN VYWQPYSVQ SFAD 


785 


SDV VYWQPYSVQ SLEM 


" 786 


YYDG VYWQPYSVQ VMPA 


787 


SDIWYQ PYALPL 


788 


QRIWWQ PYALPL 


789 



7> 
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onlWWU rYALPL 


1 790 


noLYWU rYALPL 


| 791 


1 IIWcU rYALPL 


f 792 


vvc I WTU rYALPL 


1 793 


or UWtU rYALPL 


1 794 


onlWOU rYALPL 


795 


| EiMrwu rYALPL 


796 


| UT VVVUU rYALPL 


! 797 


MULLVU WYUPYALPL 


j 798 


I oorxvlL WYUPYALPL 


| 799 


nUuAIMI WYUPYALPL 


( 800 


lacivaUfcP WYUPYALPL 


! 801 


I oULtnl WYUPYALPL 


! 802 


1 1 WVnfc WYUPYALPL 


I 803 


KKGSTQ WYUPYALPL 


I 804 


LUAKMIn WYUPYALPL 


1 805 


tPHbUK WYUPYALPL 


806 


VKUKWR WYUPYALPL 


807 


LKRHDV WYUPYALPL 


1 808 


RSTASI WYQPYALPL J 


1 809 


1 fcSKEDQ WYQPYALPL 


( 810 


EGLTMK WYQPYALPL 


1 811 


I EGSREG WYQPYALPL 


| 812 


1 VltWWQ PYALPL 


! 813 


1 VWYWEU PYALPL 


814 


ASEWWQ PYALPL " " " 


815 


FYEWWQ PYALPL 


816 


EGWWVQ PYALPL 


817 


WGEWLQ PYALPL " " 


818 


UYVWEQ PYALPL j 


819 


I AMTWWQ PYALPL | 


820 


PIEWFQ PYALPL " "J 


821 


WLAWEQ PYALPL f 


822 


VMfcWWU PYALPL I 


823 


tnlvlWU PYALPL | 


824 


I i>*AA¥Vaa rYALPL [ 


825 


vvvalMWYU rYALPL \ 


826 


f l LTWCU rYALPL 1 


827 


i vvvnvvcu rYMLrL 1 


828 


| ULWIU rinLrL 1 


829 


SRIWXX PYALPL f 




SDIWYQ PYALPL p 


831 


WGYYXX PYALPL f 


832 


TSGWYQ PYALPL r 


833 


IVHPYXX PYALPL r 


834 


EHSYFQ PYALPL T* 


835 


XXIWYQ PYALPL (" 


836 


AQLHSQ PYALPL r" 


837 


WANWFQ PYALPL T 


838 


SRLYSQ PYALPL T 


839 
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GVTFSQ PYALPl" 
SIVWSQ PYALPL 



SRDLVQ PYALPL 

HWGH VYWQPYSVQ DPLG 



SWHS VYWQPYSVQ SVPb 
WRDS VYWQPYSVQ PESA 
TWDA VYWQPYSVQ KWLU 



TPPW VYWQPYSVQ SLDP" 
YWSS VYWQPYSVQ SVHS 



YWYQPYALGL 



YWYQPYALPL 



842 
843 



847 
848 



849 
850 



EWI QPYATGL 



NWE QPYAKPL 



851 
852 
853 
854 
855 



AFYQPYALPL 
FLYQPYALPL 
VCKQPYLEWC 



FTPFTWEESNAYYWQPYALPL 



QGWLTWQDSVnMYWQPYALPL 
FSEAGYTWPENTYWQPYALPLT 
-rcQPftrtl nWAKIYWQPYALPL 



p^vnoWRDSGERYWOPYALPL 



tam v/qqffWTPGYWQPYALPL 



857 
858 
859 



861 
862 



cy«i= nHMFWTSE YWQPYALPL 



K«Ki nnTRFVSTFP YWQPYALPL 



SWSEAFEQPRNL YWQPYALPL 



OY? F pgAI NnwG YWQPYALPL 



NGDWATADWSNY YWQPYALPL 



THr>EHI YWQPYALPL 



Ml cutvttWTPG YWQPYALPL 



W SDPLTRDADL YWQPYALPL 
RDAFTTQDSQAM YWQPYALPL 



RDDAAWRTDSLT YWQPYALPL 
aiipn i VRWfiEM YWQPYALPL 



FNT vc PM\A/AnRM YWQPYALPL 



M Mnr>TQFVRTFP YWQPYALPL 



ey^c nuMFWTSE YWQPYALPL 



O TPFTWEESNAY YWQPYALKL 
ENPFTWQESNAY YWQPYALPL 
VTPFTWEDSNVF YWQPYALPLT 
r[| p cTXA/FOfiN AY YWQPYALrL 
QAPLTWQESAAY YWQPYALPL 



EP TFTWEESKAT YWQPYALPL 
ttti tmifPSNAY YWQPYALKL 
FfiPLTWEESSAL YWQPYALPL 



864 
865 



866 
867 



868 
869 
870 



871 



872 



873 



874 



875 



881 
882 



883 
884 
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STSFTWEESNAY YWQPYALPL 


890 


DSTFTWEESNAY YWQPYALPL 


891 


YIPFTWEESNAY YWQPYALPL 


892 


QTAFTWEESNAY YWQPYALPL 


893 


PTI PTWPPQMAT V\A/HDVAI Dl 
C I Lr 1 W t toINM 1 i WUr YMLr L 


on a 

894 


VQQPTVUPPOMAV V\A/r*DVAI Dl 
VoOr 1 WttolNMY YWUr YMLrL 


895 


OPVAI PI 
Ur YMLrL 


896 


Dw 1 KlonDVA IVAI Ol 
ry-1 -INapr YUJ YMLr L 


897 


TANV/QQPPWTD^ VIA/ADVA 1 Dl 
IMIVVoorCVV 1 rva YWUr YMLrL 


ana 

898 


PP\A/TDftV\A/r*DVAI Dl 
rCW 1 ro Y WUr YMLrL 


899 


P P KMT DfcVWrS IVAI Dl 

rtw I rbTWUJ YMLrL 


900 


rfcWI rvaYYUJYALrL 


901 


1 1 rr I WctoNAYYWUrYALrL 


902 


CT1A/CCCMA\A/\I/A IV/ A 1 m 

r 1 WccSNAYYWQJYALPL 


903 


ADVL YWQPYA PVTLWV 


904 


GDVAc YWQPYA LPLTSL 


905 


SWTDYG YWQPYA LPISGL 


906 


FEWTPGYWQPYALPL 


911 


FEWTPGYWQJYALPL 


912 


FEWTPGWYQPYALPL 


913 


FEWTPGWYQJYALPL 


914 


FEWTPGYYQPYALPL 


915 


r~r~i a /"T* , n/"*\x\..//"\ iv/ai m 

FEWTPGYYQJYALPL 


916 


TANVSSFEWTPGYWQPYALPL 


918 


SWTDYGYWQPYALPISGL 


919 


ETPFTWEESNAYYWQPYALPL 


920 


ENTYSPNWADSM YWQPYALPL 


921 


ovvatUrlNrW 1 ocYWQPYALrL 


922 


UbYUHWHUobbnYWUrYALrL 


923 


rtW rrCa YWQPYALPL 


924 


PC\AfrD^V\A//"7DV 

rtW 1 roY WUr Y 


925 


-C\MTDfl\/\Air\ IV 

rtW 1 rvaYWUuY 


926 


"\A/TPr2V\A/nDV 
cw I rva Y WUr Y 


927 


PP\A/TDf2\A/VO IV 

rcw I ruWYUJY 


928 


A CVA/TD/"2 V\A/A IV 

MtW 1 rva YWUJY 


929 


FAWTPrtV\A/n IV 
rMWIrVjYWUJY 


930 


PEATDf5V\A/A IV 
rtMl rvaYWUJY 


931 


PP\A/ADf5V\A/n IV 

rCWMru YWUJY j 


932 


PP\A/TA^V\A/A IV 
rtw IMoYWUJY 


933 


PClA/TDAV\A/0 IV 

rtW 1 rMYWUJY \ 


934 


FEWTPGAWQJY 


OIK i 


FEWTPGYAQJY 


936 


FEWTPGYWQJA 


937 


FEWTGGYWQJY 


938 


FEWTPGYWQJY 


939 


FEWTJGYWQJY 


940 


FEWTPecGYWQJY 


941 


FEWTPAibYWQJY 


942 


FEWTPSarWYQJY 


943 


FEWTSarGYWQJY 


944 



JET 
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mm* a ivnk IVAIIA IV/ 

FEWTPNYWQJY 


945 


FEWTPVYWQJY 


946 


FEWTVPYWQJY , _ 


947 


a j iii i i — i j — i in/A IV/ 

AcFEWTPGWYQJY - 


948 


AcFEWTPGYWQJY - 


949 


iNap-EWTPGYYQJY 


950 


YEWTPGYYQJY ■ 


951 


FEWVPGYYQJY 


952 


FEWTPGYYQJY 


953 


FEWTPsYYQJY 


954 


FEWTPnYYQJY . 


955 


SHLY-Nap-QPYSVQM — 


956 


TLVY-Nap-QPYSLQT — 


957 


RGDY-Nap-QPYSVQb 


958 


NMVY-Nap-QPYSIQT . . 


959 


VYWQPYSVQ . 


960 


VY-Nao-QPYSVQ . 


961 


TFVYWQJYALPL . - 


962 


FEWTPGYYQJ-Bpa 


963 


XaaFEWTPGYYQJ-Bpa . 


964 


FEWTPGY-Bpa-QJY . 


965 


AcFEWTPGY-Bpa-QJY . . 


966 


FEWTPG-Bpa-YQJY — 


967 


AcFEWTPG-Bpa-YQJY 


968 


AcFE-Bpa-TPGYYQJY - 


969 


AcFE-Bpa-TPGYYQJY , 


970 


Bpa-EWTPGYYQJY , 1 


971 


AcBpa-EWTPGYYQJY _ 


972 


VYWQPYSVQ 


973 


RLVYWQPYSVQR , 


974 


RLVY-Nap-QPYSVQR 


i 975 


RLDYWQPYSVQR 


976 


RLVWFQPYSVQR _ 


977 


RLVYWQPYSIQR _ 


978 


DNSSWYDSFLL 


980 


DMTAWYESFLA 


981 


DNTAWYENFLL — 


982 


"PARE DNTAWYDSFLI WU 


983 


TSEY DNTTWYEKFLA SO 


984 


SQIP DNTAWYQSFLL HG 


985 


SPFI DNTAWYENFLL TY 

EQIY DNTAWYDHFLL SY 


986 
987 
988 


TPFI DNTAWYENFLL TY - 

TYTY DNTAWYERFLM SY 

TMTQ DNTAWYENFLL SY — 


989 
990 

- - 9«h 


Tl DNTAWYANLVQ TYPQ _ 

Tl DNTAWYERFLA QYPD 


992 


HI DNTAWYENFLL TYTP . — 

SQ DNTAWYENFLL SYKA - 

Ql DNTAWYPRPl L OYNA 


993 
994 
995 
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NU UN 1 AW YfcorLL UYIM ! 




Tl nMTAlA/VCklCTI 1 MUM! 

1 1 UN 1 AW Y fcNr LL IMnNL 


007 

77/ 


MY UNIAWYtnrLU UbWtl 


770 


1 1 rr 1 WfcfcoN AY YWUr YALrL 


ooo i 


VIDCTU/CCCM A WU/ADVAI Dl 

Ylrr 1 WfcfcoNAY YWUr YALrL 


i Ann 
1UUU 


UCaYUnWHUovatnYWUr YALrL 


inni 
iuUl 


pY-lNap-pY-UJYALrL 


1UUZ 


TAMWOGCC\A/TD/*V\A/riDVAI Dl 

1 AN VoorfcW 1 rvaYWUr YALrL 




FfcWTPGYWQJYALPL 


1 AAA 
1UU4 


FEWTPGYWQPYALPLSD 


1 AAC 


FEWTPGYYQJYALPL 


lOUo 


FEwTPGYWQJY 


1 AA*7 


AcFEWTPGYWQJY 


1 AAO 

lOUo 


AcFEwTPGWYQJY 


1 AAA 


AcFEWTPGYYQJY 


1 A1 A 

1010 


AcFEWTPaYWQJY 


1 A1 1 

1011 


AcFEWTPaWYQJY 


1 A1 O 

1012 


AcFEWTPaYYQJY 


1013 


COA/TD/^Wn IVAI Dl 

rtW 1 rVaYYUJ YALrL 


mid 


FEWTPGYWQJYALPL 


1015 


FEWTPGWYQJYALPL 


1016 


TANVSSFEWTPGYWQPYALPL 


1017 


AcFEWTPGYWQJY 


1018 


AcFEWTPGWYQJY 


1019 


AcFEWTPGYYQJY 


1020 


AcFEWTPAYWQJY 


1021 


AcFEWTPAWYQJY 


1022 


AcFEWTPAYYQJY 


1023 
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Table 5— EPO-mimetic peptide sequences 



Sea uence/structure 


SEQ 
ID NO: i 


Y AOAAV3r A 1 VVAvAr 


83 


YXCXXGPXTWXCXP-YXCXXGPXTWXCXP 


OA 

84 


YXCXXGPXTWXCXP-A-YXCXXGPXTWXCXP 


85 


YXCXXGPXTWXCXP-A- ^ 

\ 
/ 

YXCXXGPXTWXCXP-A- (a-amine) 


86 
86 


GGTYSCHFGPLTWVCKPQGG 


o/ 


GGDYHCRMGPLTWVCKPLGG 


88 


GGVYACRMGPITWVCSPLGG 


89 


VGNYMCHFGPITWVCRPGGG 


90 


GGLYLCRFGPVTWDCGYKGG 


91 


GGTYSCHFGPLTWVCKPQGG- 
GGTYSCHFGPLTWVCKPQGG 


92 


GGTYSCHFGPLTWVCKPQGG -A- 
GGTYSCHFGPLTWVCKPQGG 


93 


GGTYSCHFGPLTWVCKPQlaCao&K 


94 


GGTYSCHFGPLTWVCKPQGGSSK- 
GGTYSCHFGPLTWVCKPQGGSSK 


95 


GGTYSCHFGPLTWVCKPQGGSSK-A- 
GGTYSCHFGPLTWVCKPQGGSSK 


96 


GGTYSCHFGPLTWVCKPQGGSS 

X 

\ 
/ 

GGTYSCHFGPLTWVCKPQGGSS (a-amine) 


Q7 

97 


GGTYSCHFGPLTWVCKPQGGSSK(-A-biotm) 


98 


r,X,X.GPX n TWX T C 


421 


GGTYSCHGPLTWVCKPQGG 


422 


VGNYMAHMGPITWVCRPGG 


423 


GGPHHVYACRMGPLTWIC 


. 424 


GGTYSCHFGPLTWVCKPQ 


425 


GGLYACHMGPMTWVCQPLRG 


426 


TIAQYICYMGPETWECRPSPKA 


1 427 


YSCHFGPLTWVCK 


428 


YCHFGPLTWVC 


429 


XXX.GPXJWXX 


124 


YXXXXGPXJWXX 


461 
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^YXAXXGPXJWXXXJC-X,, 


419 


X,YX,CXXGPX.7WX,CXXX 


420 


GGLYLCRFGPVTWDCGYKGG 


1024 


GGTYSCHFGPLTWVCKPQGG " ~ 


1025 


GGDYHCRMGPL 1 WVCKPLGG 


1026 


voiVTMUnrbrlTWVCRPGGG 


1029 


GGVYACRMGPI7WVCSPLGG 
VGNYMAHMGPITWVCRPGG 
GGTYSCHFG PLTWVCKPQ 
GGLYACHMGPMTWVCQPLRG 


1030 
1035 
1036 
1037 


TIAQYICYMGPETWECRPSPKA 

YSCHFGPLTWVCK ■ 

YCHFGPLTWVC 

SCHFGPLTWVCK 

AXJAXXGPXJWXX 


1038 1 

1039 

1040 

1041 

1042 
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Table 6-TPO-mimetic peptide sequences 




1egptlrqwlaara-a-iegptlhuulmARA 

iPftPTI-RQWLAA P A A-'cftPTlRQCLAARA 
VRDQIXXXL 
TLREWL 
GRVRDQVAGW 
GRVKPQ1AQL 



GVRDQVSWAL 
ESVREQVMKY 
SVRSQISASL 



gvretvyrhm 

gvrevivmhml 

grvrdqiwaaT 



AGVRDQILIWL 



GRVRDQIMLSL 

GRVRDQKX)X~ 
OTLRQWLQGC 
CTLQEFLEGC 
CTRTEWLHGC 
nTLREWLHGGFC 
riTl REWVFAGLC 
' r. n RQWLILLGMC 



' r.TLAEFLASGVEQC 
~CSj QEFLSHG GYVC 
"fori REFLDPT I'AVC 
ViTLKEWLVSHEVWC 
r,TLREWL(XL,C 
REGPTLRQWM 
FGPTLRQWLA 
ERGPFWAKAC 
REGPRCVMWM 
Y.GTEGPTLSTWLDC. 



43 



48 

49 



50 
51 



52 
53 
54 
55 
56-60 
6f 



Ho 
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CEQDGPTLLEWLKC 


66 


CELVGPSLMSWLTC 


67 


CLTGPFVTQWLYEC 


68 


CRAGPTLLEWLTLC 


69 


LrMUvar 1 LntWIorU 


70 


C(XLEGPTLREWL(X),,C 


71-74 


GGCTLREWLHGGFCGG 


75 


GGCADGPTLREWISFCGG 


76 


GNADGPTLRQWLEGRRPKN 


77 


LAIEGPTLRQWLHGNGRDT 


78 


HGRVGPTLREWKTQVATKK 


79 


TIKGPTLRQWLKSREHTS 


80 


ISDGPTLKEWLSVTRGAS 


81 


SIEGPTLREWLTSRTPHS 


82 
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Table 7— G-CSF-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 

99 


EEDCK . 

EEDCK 


99 


1 

EEDCK ■ — 


99 
100 


EEDoK . 

EEDoK 


100 


1 

EEDoK . 


100 
101 


pGluEDoK 

pGluEDoK 

1 

PGluEDoK 


101 

101 
102 


PicSDoK 

PicSDoK 


102 


1 

PicSDoK . 

EEDCK-A-EEDCK ■ 

EEDXK-A-EEDXK . 


102 
103 
104 
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Table 8— TNF-antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


YCFTASENHCY 


106 


YCFTNSENHCY 


107 


YCFTRSENHCY 


108 


FCASENHCY 


109 


YCASENHCY 


110 


FCNSENHCY 


111 


FCNSENRCY 


112 | 


FCNSVENRCY 


113 


YCSQSVSNDCF 


114 


FCVSNDRCY 


115 


YCRKELGQVCY 


116 


YCKEPGQCY 


117 




118 


FCRKEMGCY 


119 


YCWSQNLCY 


120 


YCELSQYLCY 


121 


YCWSQNYCY 


122 


YCWSQYLCY 


123 


DFLPHYKNTSLGHRP 


1085 


AA.-AB, 

\ 

AC 

/ 

AA.-AB, 


NR 





Hi 
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Table 9— Integrin-binding peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


RX^TXyVVX, 


441 


R^ETXjWX, 


442 


RGDGX 


443 


CRGDGXC 


444 


r.Y X RLDXJCC ! 


445 


CARRI HAPC 


446 


CPSRLDSPC .... 


447 


XXX RGDXXX- 


448 


CX ORGPOX r C 


449 


mORGDCFC 


450 


CDCRGDCLC 


451 


CLCRGDCIC 


452 


X X DDX X XJ( 


453 


X XJCDDX X XXX« 


454 


nWDDGWLC 


455 


nwnni wwlc 


456 ! 


CWDDGLMC 


457 


nwnnfiWMC 


458 


CSWDDGWLC 


459 


npnnLWWLC 


460 


KIGR 


NR 


Gfil 


NR 


RGD 


NR 


CGRECPRLCQSSC 


1071 


PNGRPA/SGOAGRC 


1072 




1073 


RGD 


NR 


NGR 


NR 


GSL 


NR 


NGRAHA 


1074 


ONGRO . .. . 


1075 


CDCRGDCFC 


1076 | 


CGSLVRC 


1077 


DLXXL 


1043 


RTDLDSLRTYTL 


1044 


RTDLDSLRTY 


1053 


RTDLDSLRT 


1054 


RTDLDSLR 


1078 


GDLDLLKLRLTL 


1079 


GDLHSLRQLLSR 


1080 


RDDLHMLRLQLW 


1081 


SSDLHALKKRYG 


1082 


RGDLKQLSELTW 


1083 


RGDLAALSAPPV 


1084 



Ml 
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Table 10— Selectin antagonist peptide sequences 



UCJ.1CC/ 9U UllUXC 


SEQ 
ID NO: 


DITWDQLWDLMK 


147 


Ul rVvDELWKIMN 


148 


UYlvvFELWDMMQ 


149 


QITWAQLWNMMK 


150 


DM7WHDLWTLMS 


151 


DYSWHDLWEMMS 


152 


EITWDQLWEVMN 


153 


HVSWEQLWDIMN 


154 


HITWDQLWRIMT 


155 


RNMSWLELWEHMK 


156 


AEWTWDQLWHVMNPAESQ 


157 


HRAEWLALWEQMSP 


158 


KKEDWLALWRIMSV 


159 


ITWDQLWDLMK 


160 


DITWDQLWDLMK 


161 


DITWDQLWDLMK 


162 


DiTwnni wni kaw ~ — 


163 


CQNRYTDLVAIQNKNE 


462 


AENWADNEPNNKRNNED 


463 


RKNNK7WTWVGTKKALTNE 


464 


KKALTNEAENWAD 


465 


CQXRYTDLVAIQNKXE 


466 


RKXNXXWTWVGTXKXLTEE 


467 


AENWADGEPNNKXNXED 


468 


CXXXYTXLVAIQNKXE 


469 


RKXXXXWXWVGTXKXLTXE 


470 


AXNWXXXEPNNXXXED 


471 


XKXKTXEAXNWXX 


472 



7r 
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Table 11-Antipathogenic peptide sequences 



Sequence/structure 



fiF FALIPKHSSPLH ™ l SAVGSALSSSGGQgL 
GFFALIPKIISSP' FKTLLSAVGSALSSSGGQE 



SEQ 
IP NO; 

503 
504 



riFF Al IPKI1SSPLFKTLLSAV 



rcc cAl tPKIISSPLFKTLLSAV 



Vftcc &i IPKIISSPLFKTLLSAV 



./i^f fai IPKIISSPLFKT LLSAV 



KKGFFALIPKIISSPLFKTLLSAV 



GFF ALIPKIIS 

nri fli;i u\n TTfil PALISWIKHNRQQ 



Oi G ™* *VI TTfiLPAUSWIKHKRQQ 



hl GAVLKVLTTP I PAUSWIKHKrtQQ 
^pAXj.iA/iTTfilPALISWIKH 



AVLKVLTTGLPAUSWIKR 
KLL LLLKLLLLK 



KLLLKLLLKLLK 
KLLLKLKLKLLK 
KKLLKLKLKLKK 
KLLLKLLLKLLK 
KLLLKLKLKLLK 
KLLLLK 



KLLLKLLK 



KLLLKLKLKLLK 
KLLLKLKLKLLK 



KLLLKLKLKLLK 
KAAAKAAAKAAK 
KVWKVVVKWK 
KVWKVKVKWK 
KVWKVKVKVK 
KVWKVKVKWK 
KLILKL 
KVLHLL 
LKLRLL 
KPLHLL 
KLILKLVR 
KVFHLLHL 



HKFRILKL 
KPFHILHL 



507 



508 



509 



510 
511 



512 
513 



514 
515 
516 



525 
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RIIVRIRI RliR 
nil vninuniin 


549 


RlftlPI RVRIIR 


550 


KIVIRIRIRI iR 
rMvinininuin 


551 


RIAVKWRI RFIK 
niMvi\vvnLnrir\ 


552 


i\iowr\Ln vniin 


553 


IflllftWI IIRVRR 
r\r\iCi vvLii n v nn 


554 


PI\/IPIPIRI IRIR 
nlVlnlnlnUnin 


555' 


RIIVRIRI RIIRVR 
nllvninLniinvn 


556 


DI/^IDi RWRIIPRV 

nlulnLnVnllnnV 


557 


LfiVIDlDAQI IRIDIR 
ftlVinlnAnLlnlnin 


558 


DIIWI/IDI DIIUVIDI 

nllVKInLnlli\l\lnL 


559 


l/IOII/ A DWDIIDWl/M 


SAO 


DMWUIDI DHUUIDI i 

nllVnlnLnlinnlriL 


561 


UifMI/AUWDItDWLIII 


562 


divvhiuii nvik'k'lRl 
Ml Y V IMnLn Y IIMMnL 


563 


KlunrSAnVnilnYftll j 


564 


DIVWI/DUDDVIl/l/IDI 

HIT VKrnrnYlftlMnL 


•5615 


l/Br* Lll/ A □ DLI 1 \ DVI/I 1 


566 


I/IWID1DIDI IDIDIDI/IW 

KIVIRInlHLInlnlnKIV 


567 


DIIWl/IDI DM1/I/IDI IkV < 

nllVKInLnllKKInLIKK 


568 


1/irMA/ISI DWDIIDWI/l/2Di □ 

KIGWKLnVHIInVKIunLn , 


56Q 


l/IWIDIDIDI IDIDtDI/IWLA/l/DID 

KlVlnlnlnLlnlninWVKVKHIn 




DCAWI/IDI DIII/I/IDI li/l/IDI^D\/ll/ 


571 




572 


DIVWI/DUDDVII/lflDI 

nlYVKrnrnYIKnlnL 


573 


l/Df2LII/ADDUIIDVI/N 


574 


MvlnlnlnLlnlnlnlxlV 


575 


DIIWL/IDI DIIUVIDI \KK 

HH V r\l n Lnl 1 JNlNin LI l\f\ 


576 


D IVWG1/1 0 1 VI l/I/l Ol 

ril Y V oMo 1 Y I fsj\\ n L 


577 


KIVIFTRIRLTSIRIRSIV 


578 ( 


KPIHKARPTIIRYKMI 


579 I 


cyclicCKGFFALIPKIISSPLFKTLLSAVC 


580 


CKKGFFALIPKIISSPLFKTLLSAVC 


581 


CKKKGFFALIPKIISSPLFKTLLSAVC 


582 


CyclicCRIVIRIRIRURIRC 


583 


CyclicCKPGHKARPHIIRYKIIC 


584 


CyclicCRFAVKIRLRIIKKIRLIKKIRKRVIKC 


585 


KLLLKLLL KLLKC 


586 | 


KLLLKLLLKLLK 


587 


KLLLKLKLKLLKC 


588 


KLLLKLLLKLLK 


589 



V 
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Table 12— VIP-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


HSDAVFYDNYTR LRKQMAVKKYLN SILN 


590 


NIp HSDAVFYDNYTR LRKQMAVKKYLN SILN 


591 


x Y'y"x_ 


592 


Y SX LN 


593 ! 


NH CH CO KKYX5 NH CH CO X6 


594 


| | 

(P,H2)m Z (CH2)n 




KKYL 


595 


NSILN 


596 


KKYL 


597 


KKYA 


598 


AVKKYL 


599 


NSILN 


600 


KKYV 


601 


SILauN 


602 


KKYLNIe 


603 


NSYLN 


604 


NSIYN 


605 


KKYLPPNSILN 


606 | 


1 an KKYL 


607 


PlanKKYL 


608 


KYL 


NR 


KKYNIp 


609 


VKKYL 


610 


LNSILN 


611 


YLNSILN 


612 


KKYLN 


613 


KKYLNS 


! 614 


KKYLNSI 


615 I 


KKYLN SI L 


I 616 


KKYL 


i 617 


KKYDA 


618 


AVKKYL 


619 


NSILN 


! 620 


KKYV 


621 


SILauN 


622 


NSYLN 


623 


NSIYN 


624 


KKYLNIe 


625 


KKYLPPNSILN 


626 


KKYL 


627 


KKYDA 


" ~628- 


AVKKYL 


629 


NSILN 


630 


KKYV 


631 


SILauN 


632 
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Table 15— Mast cell antagonists/Mast cell protease inhibitor 



peptide sequences 



Sequence/structure 


QUA 

> ID NO: 


SGSGVI KRPI Pll PV/TD " 

s3\jo>J Vl_r\nrLriLr V 1 rl 


272 


j n»»LOgnrLrrLrLrrn 1 


273 


u «uo i U 1 UMLr OLrLrlr WIoo 


274 


GSGSYDTRALPSLPLHPMSS 




GSGSSGVTMYPKLPPHWSMA 


276 


GSGSSGVRMYPKLPPHWSMA 


277 


GSGSSSMRMVPTIPGSAKHG 


278 


RNR 


NR 


QT 


NR 


RQK 


NR 


NRQ 


NR 


RQK 


NR 


RNRQKT 


436 


RNRQ 


437 


RNRQK 


438 


NRQKT 


439 


RQKT 


440 
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Table 16-SH3 antagonist peptide seq uences 

SEQ 

Sequence/structure 



RPLPPLP 



RPLPPTP 



RPQPPPP 



RQLPIPP 
XXXRPLPPLPXP 
XXXRPLPPIPXX 
XXXRPLPPLPXX 
RXXRPLPPLPXP 



RXXRPLPPLPPP 
PPPYPPPP1PXX 
PPPYPPPPVPXX 
LXXRPLPX¥P~ 
VXXRPLPXLP 

ppxexpppyp 

+PPVPXKPXWL 



I P NO; 

282 



283 
284 
285 



293 



294 



304 



305 




306 
307 
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Table 17— Somatostatin or cortistatin mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


X 1 -X 2 -Asn-Phe-Phe-Tro-Lvs-Ttir-PheO( 3 -Ser-X 4 

/\ f\ fWI III IO 1 1 lw 1 1 fj bJO 1 1 II II IW /\ UOI /\ 


473 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys Lys 


474 


Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys Lys 


475 


Cvs Ara Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs Lvs 


476 


Asd Ara Met Pro Cvs Ara Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs 


477 


Met Pro Cvs Ara Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs 


478 


Cvs Ara Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs 


479 


Asa Ara Met Pro Cvs Lvs Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs 

nvv niy ivivi i lw W/Jrw w nun i i iv ■ iiv " i v i»j w • i h i i iv wwt vv» w/jw 


480 


Met Pro Cvs Lvs Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs Lvs 

ivivi i iv vjw ljw not 1 1 1 1 V 1 1 lw 1 1 V 1— JTw 1 III ■ *IV Vvl V#VI VJO fc» J w 


481 


Cvs Lvs Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cvs Lvs 

V/ Jf w kjr w rwi III lu 1 1 1 W lip* w 1 1 II 1 I iv Wwl Uwl w Jf O ^ jr O 


482 


Asn Ara Met Pro Cvs Lvs Asn Phe Phe Trn Lvs Thr Phe Ser Ser Cvs 


483 


Met Pro Cvs Lvs Asn Phe Phe Trn Lvs Thr Phe Ser Ser Cvs 


484 


Cue 1 uc Acn Pho Pho Trn 1 uc Thr Pho Qpr 9or Cue 
\jyo >-jo r%oii rue nio 1 ip Lyo 1 ill r 110 001 wot uyo 


too 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


486 


Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


487 


Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


488 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


489 


Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


490 


Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


491 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


492 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


493 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


494 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


495 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


496 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


497 



to 
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Table 18— UKR antagonist peptide sequences 



Sequence/structure 


SEQ 


AEPMPHSLNFSQYLWYT 


\yb 


AEHTYSSLWDTYSPLAF 


i an 


AELDLWMRHYPLSFSNR 




AESSLWTRYAWPSMPSY 


iyy 


AEWHPGLSFGSYLWSKT 


AAA 

200 \ 


AEPALLNWSFFFN PGLH 


AA1 

201 


AEWSFYNLHLPEPQTIF 


202 


AEPLDLWSLYSLPPLAM 


203 


AEPTLWQLYQFPLRLSG . 


OA/1 

204 


AEISFSELMWLRSTPAF 


205 


AELSEADLWTTWHjIVHjS _ — 


206 


AESSLWR 1 FS PS ALMMS . . 


207 


AESLPTLTSILWGKESV 


208 


AETLFMDLWHDKHILLT , _ 


209 


AEILNFPLWHEPLWSTE 


210 


AESQTGTLNTLFWNTLR 


211 


AFPVYGYELDSYLRSYY 


430 


"aeldlstfydiqyllrt 


431 


AEFFKLGPNGYVYLHSA _ 


432 


FKLXXXGYVYL , 


433 


AESTYHHLSLGYMYTLN 

YHXLXXGYMYT _ ■ 


~~ 434 
435 



WO 00/24782 



PCT/US99/25044 



Table 19— Macrophage and/or 



T-cell inhibiting peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


Xaa-Yaa-Arg 


NR 


Arg-Yaa-Xaa 


NR 


Xaa-Arg-Yaa 


NR 


Yaa-Arg-Xaa 


NR 


Ala-Arg 


NR 


Arg-Arg 


NR 


Asn-Arg 


NR 


Asp-Arg 


NR 


Cys-Arg 


KTD 


Gln-Arg 


NR 


Glu-Arg 


KTD 

NK 


Gly-Arg 


NR 


His-arg 


NR | 


lle-Arg 


1 KTD 
INK 


Leu-Arg 


x to 

NK 


Lys-Ara 


KTD 


Met-Arg 


xrn 
NK 


Phe-Arg 


KTD 

NK 


Ser-Arg 


x TD 

NK 


Thr-Arg 


KTD 

NK 


Tro-Ara 


KTD 
NK 


Tyr-Arg 


KTD 
NK 


Val-Arg 


MP 
INK 


Ala-Glu-Arg 


NR 


Arg-Glu-Arg 


NR 


Asn-Glu-Arg 


NR 


Asp-Glu-Arg 


NR 


Cys-Glu-Arg 


NR 


Gln-Glu-Arg 


NR 


Glu-Glu-Arg 


NR 


Gly-Glu-Arg 


NR 


His-Glu-Ara 


NR 


lle-Giu-Arg 


NR 


Leu-Glu-Arg 


NR 


Lys-Glu-Arg 


NR 


Met-Glu-Arg 


NR 


Phe-Glu-Arg 


NR ! 


Pro-Glu-Arg 


NR 


Ser-Glu-Arg - — 


- NR 


Thr-Glu-Arg 


NR 


Trp-Glu-Arg 


NR 


Tyr-Glu-Arg ] 


NR 


Val-Glu-Arg 


NR 
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Arg-Met 
Arg-Phe 
Arg-Pro 
Arg-Ser 
Arp-Thr 

Arg-Trp 

Arp-Tyr 
Arg-Val 
Arp-Glu-Ala 
Ar g-Glu-Asn 
3-Glu-Asp 
Arp-Glu-Cys 



NR 
NR 
NR 



NR 
NR 
NR 
NR 
NR 
NR 
NR 
NR 



NR 
NR 
NR 
NR 



Arg-Glu-Glu 



Ar p-Glu-Gly 



Arp-Glu-His 



Arg-Glu-He 



Arg-Glu-Leu 



Arg-Glu-Lys 



Ar p-Glu-Met 



NR 
NR 
NR 



NR 
NR 
NR 
NR 
NR 
NR 
NR 



Arp-Glu-Phe 



Arg-Glu-Pro 



Arq-Glu-Ser 



Arp-Glu-Thr 



Arp-Glu-Trp 



Arp-Glu-Tyr 



Glu-Val 



Arg-Arg-Glu 



Asn-Arg-Glu 



Gln-Arq-Glu 



Si* 



NR 
NR 
NR 
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Phe-Ara-Glu 

r 1 10 *»i y xjiu 


MP 

I^IJX 


Pro-Ara-Glu 

riu niy \juu 


NR 


Ser-Ara-Glu 


MP 
i\ i\ 


Thr-Am-Glu 


MP 


Tro-Arn-Giii 


MR 


Tvr- Arn-£5h i 

i yi my villi 




Vai Miy uuu 


MP 
INK 


fiiU t.Arrt-Ala 


MP 
INK 


^li i Arn Am 

cuu-Miy-ttrg 


MP 
INK 


fill i. A rn Acn 

oiu Mrg-Msn 


\TP 
INK 


/j 1 1 1 Am A €m*h 

ValU-Mry-MSp 


MP i 
INK 


ValU-Mly-oyS 


INK 


vaiu-Mry-ciin 


MT> 

JNK 


/'ilii Am /*ilw 

uiU-Mry-oiy 


MP 

Ink 


r^li i- A rn.Uic 

oiu "Mi y*n is 


MP 

INK 1 


/ill i-ArnJIo 


MP 

INK 


filn-Arn-l ah 

viiu *ii y Lou 


MP 


Glu-Arg-Lys 


NR 


Glu-Arg-Met 


NR 


Glu-Arg-Phe 


NR 


Glu-Arg-Pro 


NR 


Glu-Arg-Ser 


NR 


Glu-Arg-Thr 


NR 


Glu-Arg-Trp 


NR 


GIu-Arg-Tyr 


NR 


Glu-Arg-Val 


NR 



si 



WO 00/24782 



PCT/US99/25044 



Table 20— Additional Exemplary Pharmacologically Active Peptides 



oeq uence/ oiiuc iuxc 


SEQ 
ID 
NO: 


Activity 


VEPNCDIHVMWEWECFERL 


1027 


VEGF-antagonist 


GERWCFDGPLTWVCGEES 


1084 


VEGF-antagonist 


RGWVEICVADDNGMCVTEAU 


1085 


VEGF-antagonist 


GWDECDVARMWEWECFAGV 


1086 


VEGF- antagonist 


GERWCFDGPRAWVCGWEI 


501 


VEGF- antagonist 


EELWCFDGPRAWVCGYVK 


502 


VEGF- antagonist 


RGWVEICAADDYGRCLTEAQ 


1031 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1087 


VEGF- antagonist 


RGWVEICESDVWGRCL _I 


1088 


VEGF- antagonist 


GGNECDIARMWEWECFERL 


1089 


VEGF- antagonist 


RftWVFICAADDYGRCL 1 


1090 


VEGF-antagonist 


CTTHWGFTLC 


1028 


MMP inhibitor 


CLRSGXGC 


Ivy 1 


IVIIVIr II II IIUllUl 


TYYHWfiFXXC 


1092 


MMP inhibitor 


OYPYP 


1093 


MMP inhibitor 




1094 


MMP inhibitor 


QTTWWftFTI 5? 


1095 


MMP inhibitor 


OoLnvvvjrvvvtw 


1096 


CTLA4-mimetic 


ncx/r^cri iFA\/f3\/f5RCi 


125 


CTLA4-mimetic 




126 


CTLA4-mimetic 


1 1 rcDMk* 


105 


Antiviral (HBV) 


" ir wnnWGHHRCTAGHMANLTSHASAI 


127 


C3b antagonist 


1CVVQDWGHHRCT . 


128 


C3b antagonist 


CVVQDWGHHAC 


129 


C3b antagonist 

\/inr»i ilin-hinriinn 


STGGFDDVYDWARGVSSALTTTLVATR 
STGGFDDVYDWARRVSSAL 1 1 ILVATR 


186 


Vinculin-binding 


SRGVNFSEWLYDMSAAMKEASNVFPSRRSR 


187 


Vinculin-binding 


SSQNWDMEAGVEDLTAAMLGLLSTIHSSSR 


188 


Vinculin-binding 


<i<5pqi YTOFLVNYESAATRIQDLLIASRPSR 


189 


Vinculin-binding 


SSTGW VDLLGALQRAADATRTSI PPSLQNSR 


190 


Vinculin-binding 


DVYTKKELIECARRVSEK 


191 


Vinculin-binding 


EKGSYYPGSGIAQFHIDYNNVS 


192 


C4BP-binding 


SG IAQFH IDYNN VSSAEGWHVN 


193 


C4BP-binding 


LVTVEKGSYYPGSGIAQFHIDYNNVSSAEGWHVN 


194 


C4BP-bindlng 


SGIAQFHIDYNNVS 
LLGRMK 


195 
279 
280 


C4BP-binding 
anti-HBV 
anti-HBV 


ALLGRMKG 

LDPAFR 

CXXRGDC 

RPLPPLP 


281 
322 

323 


anti-HBV 
inhibition of platelet 
aaoreaation 
Src antagonist 


PPVPPR 
XFXDXWXXLXX 


324 
325 


Src antagonist 

Anti-cancer 
(particularly for 



WO 00/24782 



PCT/US99/25044 



KACRRLFGPVDSEQLSRDCD 


1 

326 


sarcomas) 
p16-mimetic 


RERWNFDFVTETPLEGDFAW 


327 


p16-mimetic 


KRRQTSMTDFYHSKRRLIFS 


328 


pl8-mimetlc 


TSMTDFYHSKRRLIFSKRKP 
RRLIF 

KRRQTSATDFYHSKRRUFSRQIKIWFQNRRMKWKK 


329 
330 


p16-mimetic 


KRRLIFSKRQIKIWFQNRRMKWKK 

Asn Gin Gly Arg His Phe Cys Gly Gly Ala Leu He His Ala 

Arg Phe Val Met Thr Ala Ala Ser Cvs Phe Gin 


331 
332 
498 


p16-mimetic 
p16-mimetic 
CAP37 mimetic/LPS 
binding 


Arg His Phe Cys Gly Giy Ala Leu lie His Ala Ara PhP Val 
Met Thr Ala Ala Ser Cys 


AQQ 


PAD07 mifH«i!>/i nr> 

OArd/ mimetic/LPS 

L Dinging 


Gly Thr Arg Cys Gin Val Ala Gly Tip Gly Ser Gin Arg Ser 
uiy oiy arg Leu ber Arg Phe Pro Arg Phe Val Asn Val 


500 


CAP37 mimetic/LPS 
binding 


WHWRHRIPLQLAAGR 
LKTPRV 


ivy/ 


carbohydrate (GD1 
alpha) mimetic 


N 1 LKTPRV 


1098 
1099 


82GPIAbbindinq 
B2GPI Ab binding 


NTLKTPRVGGC 


1100 


02GPI Ab bindina 


KDKATF 


1101 


32GPI Ab binding 


KDKATFGCHD 
KDKATFGCHDGC 


1102 


B2GPI Ab bindina 


TLRVYK 
ATLRVYKGG 


1103 
1104 


32GPIAb binding 
P2GPI Ab binding 


CATLRVYKGG 


1105 
1106 


P2GPI Ab binding 
P2GPI Ab binding 


INLKALAALAKKIL 


1107 


Membrane- 
transportino 


GWT 


NR 


Membrane- 
transportind 


GWILNSAGYLLG " ' 


1108 


Membrane- 
transporting 


GWTLNSAGYLLGKINLKALAALAKKIL 


1109 


Membrane- 
transoortinq 



present invention is also particularly useful with peptides 



having activity in treatment of: 

• cancer, wherein the peptide is a VEGF-numetic or a VEGF receptor 
antagonist, a HER2 agonist or antagonist, a CD20 antagonist and the 
like; 

• asthma, wherein the protein of interest is a CKR3 antagonist, an IL-5 
receptor antagonist, and the like; 

• thrombosis, wherein the protein of interest is a GPIIb antagonist, a 
GPIHa antagonist, and the like; 
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. auK^unedisaa^andothercondiBonsinvolvingtamune 
modulate., whemin fhe protein of interest is an IL-2 receptor 
antagonist, a CD40 agonist or antagonist, a CD40L agonist or 
antagonist, a thymopoietin mimetic and the like. 

This invention requires the presence of at least one vehicle 

(F' F 5 ) attached to a peptide through the N-tenninus, C-terminus or a 
sidechamofoneofmeammoacidresidues.Multiple vehicle, may a^obe 

used;e. g .,Fc-sa.eachtermmusoranFca.aterminusandaPBGg ro npa. 

the other terminus or a sidechain. 

A„ Fc domain is the preferred vehicle. The Fc domain may be fused 
.otheNorC^oftt.epepddesoratbomureNandC.ernuni.For 
.heTro-numedcp.pttdes.molecuieshavurgtheFcdomamf^todteN 

tenninus of the peptide portion of themoleone are more bioactive than 
ofhersuchfusions,sofusion.o*e Nte rminusispreferred. 

As noted above, Fc variants are suitable vehicles within the scope of 
invention. A native Fc may be extensively modified to form an Fc 

receptor is maintained; see, for example WO 97/34631 and WU V / 
.nsuchFcvanants.onemayremoveoneormoresftesofananveFc.hat 

pnovide structural features or funcdonal acttvity not required by the 
toionmoleculesofdusmvendon.Onemay remove mesestoby.for 

example.subsdtunngordeleongresidues.toernngresiduesm.odtes.re, 

ornrmcanngportior^con^gdresl^Theinsertedorsubs^d 

maWue.nuyalsobealtemdaminoacids.suchaspeptidomimencsorD- 

amino acids. Fc variant may be desfrable for a number of reasons, severe! 
ofwhichamdescnbodbelow.ExemplaryFcvarian^mctademofecules 

and sequences in which: 

x Sites invofved in disuse bond formation are removed. Such remove. 
' IMV avoidreac«onw i m<*erc,s.eine<on e uungpro tt inspresentm 

' (ft 
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the host cell used to produce the molecules of the invention. For this 
purpose, the cysteme^ontaining segment at the N-teiminus may be 
truncated or cysteine residues may be deleted or substituted with other 
amino acids (e.g., alanyl, seryl). In particular, one may truncate the N- 
5 terminal 20-amino acid segment of SEQ ID NO: 2 or delete or 

substitute the cysteine residues at positions 7 and 10 of SEQ ID NO: 2. 
Even when cysteine residues are removed, the single chain Fc domains 
can still form a dimeric Fc domain that is held together non-covalently. 

2. A native Fc is modified to make it more compatible with a selected host 

1 o cell. For example, one may remove the PA sequence near the N- 

terminus of a typical native Fc, which may be recognized by a digestive 
enzyme in E.coli such as proline iminopeptidase. One may also add an 
N-terminal methionine residue, especially when the molecule is 
expressed recombinantly in a bacterial cell such as E. coli. The Fc 
1 5 domain of SEQ ID NO: 2 (Figure 4) is one such Fc variant. 

3. A portion of the N-terminus of a native Fc is removed to prevent N- 
terminal heterogeneity when expressed in a selected host cell. For this 
purpose, one may delete any of the first 20 amino acid residues at the 
N-terminus, particularly those at positions 1, 2, 3, 4 and 5. 

20 4. One or more glycosylate sites are removed. Residues that are 

typically glycosylated (e.g., asparagine) may confer cytolytic response. 
Such residues may be deleted or substituted with unglycosylated 
residues (e.g., alanine). 
5. Sites involved in interaction with complement, such as the Clq binding 

2 5 site, are removed. For example, one may delete or substitute the EKK 

sequence of human IgGl. Complement recruitment may not be 
advantageous for the molecules of this invention and so may be 
avoided with such an Fc variant. 
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6. Sites are removed that affect binding to Fc receptors other than a 
salvage receptor. A native Fc may have sites for interaction with 
certain white blood cells that are not required for the fusion molecules 
of the present invention and so may be removed. 
5 7. The ADCC site is removed. ADCC sites are known in the art; see, for 

example, Molec. Immunol . 29 (5): 633-9 (1992) with regard to ADCC 
sites in IgGl. These sites, as well, are not required for the fusion 
molecules of the present invention and so may be removed. 
8. When the native Fc is derived from a non-human antibody, the native 
10 Fc may be humanized. Typically, to humanize a native Fc, one will 

substitute selected residues in the non-human native Fc with residues 
that are normally found in human native Fc. Techniques for antibody 
humanization are well known in the art. 

Preferred Fc variants include the following. In SEQ ID NO: 2 
1 5 (Figure 4) the leucine at position 15 may be substituted with glutamate; the 
glutamate at position 99, with alanine; and the lysines at positions 101 and 
103, with alanines. In addition, one or more tyrosine residues can be 
replaced by phenyalanine residues. 

An alternative vehicle would be a protein, polypeptide, peptide, 
2 0 antibody, antibody fragment, , or small molecule (e.g., a peptidomimetic 
compound) capable of binding to a salvage receptor. For example, one 
could use as a vehicle a polypeptide as described in U.S. Pat. No. 5,739,277, 
issued April 14, 1998 to Presta etal. Peptides could also be selected by 
phage display for binding to the FcRn salvage receptor. Such salvage 
2 5 receptor-binding compounds are also included within the meaning of 

"vehicle" and are within the scope of this invention. Such vehicles should 
be selected for increased half-life (e.g., by avoiding sequences recognized 
by proteases) and decreased immunogenicity (e.g., by favoring non- 
immunogenic sequences, as discovered in antibody humanization). 
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As noted above, polymer vehicles may also be used for F 1 and F 2 . 
Various means for attaching chemical moieties useful as vehicles are 
currently available, see, e.g., Patent Cooperation Treaty ("PCT") 
International Publication No. WO 96/11953, entitled "N-Terminally 
Chemically Modified Protein Compositions and Methods," herein 
incorporated by reference in its entirety. This PCT publication discloses, 
among other things, the selective attachment of water soluble polymers to 
the N-terminus of proteins. 

A preferred polymer vehicle is polyethylene glycol (PEG). The PEG 
group may be of any convenient molecular weight and may be linear or 
branched. The average molecular weight of the PEG will preferably range 
from about 2 kiloDalton ("kD") to about 100 kDa, more preferably from 
about 5 kDa to about 50 kDa, most preferably from about 5 kDa to about 
10 kDa. The PEG groups will generally be attached to the compounds of 
the invention via acylation or reductive alkylation through a reactive 
group on the PEG moiety (e.g., an aldehyde, amino, thiol, or ester group) 
to a reactive group on the inventive compound (e.g., an aldehyde, amino, 
or ester group). 

A useful strategy for the PEGylation of synthetic peptides consists 
of combining, through forming a conjugate linkage in solution, a peptide 
and a PEG moiety, each bearing a special functionality that is mutually 
reactive toward the other. The peptides can be easily prepared with 
conventional solid phase synthesis (see, for example, Figures 5 and 6 and 
the accompanying text herein). The peptides are "preactivated" with an 
appropriate functional group at a specific site. The precursors are purified 
and fully characterized prior to reacting with the PEG moiety. Ligation of 
the peptide with PEG usually takes place in aqueous phase and can be 
easily monitored by reverse phase analytical HPLC. The PEGylated 
peptides can be easily purified by preparative HPLC and characterized by 
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analytical HPLC, amino acid analysis and laser desorption mass 
spectrometry. 

Polysaccharide polymers are another type of water soluble polymer 
which may be used for protein modification. Dextrans are polysaccharide 

5 polymers comprised of individual subunits of glucose predominantly 
linked by al-6 linkages. The dextran itself is available in many molecular 
weight ranges, and is readily available in molecular weights from about 1 
kD to about 70 kD. Dextran is a suitable water soluble polymer for use in 
the present invention as a vehicle by itself or in combination with another 

10 vehicle(e.g.,Fc).See,forexample,WO96/11953andWO96/05309.The 

use of dextran conjugated to therapeutic or diagnostic immunoglobulins 
has been reported; see, for example, European Patent Publication No. 0 
315 456, which is hereby incorporated by reference. Dextran of about 1 kD 
to about 20 kD is preferred when dextran is used as a vehicle in 
1 5 accordance with the present invention. 

Linkers . Any "linker" group is optional. When present, its chemical 
structure is not critical, since it serves primarily as a spacer. The linker is 
preferably made up of amino acids linked together by peptide bonds. 
Thus, in preferred embodiments, the linker is made up of from 1 to 20 
2 0 amino acids linked by peptide bonds, wherein the amino acids are selected 
from the 20 naturally occurring amino acids. Some of these amino acids 
may be glycosylated, as is well understood by those in the art. In a more 
preferred embodiment, the 1 to 20 amino acids are selected from glycine, 
alanine, proline, asparagine, glutamine, and lysine. Even more preferably, 
2 5 a linker is made up of a majority of amino acids that are sterically 
unhindered, such as glycine and alanine. Thus, preferred linkers are 
polyglycines (particularly (Gjr>„ <PjW, poly(Gly-Ala), and polyalanines. 
Other specific examples of linkers are: 

(Gly) 3 Lys(Gly) 4 (SEQ ID NO: 333); 
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(Gly) 3 AsnGlySer(Gly) : (SEQ ID NO: 334); 
(Gly),Cys(Gly), (SEQ ID NO: 335); and 
GlyProAsnGlyGly (SEQ ID NO: 336). 
To explain the above nomenclature, for example, (Gly),Lys(Gly) 4 means 
5 Gly-Gly-Gly-Lys-Gly-Gly-Gly-Gly. Combinations of Gly and Ala are also 
preferred. The linkers shown here are exemplary; linkers within the scope 
of this invention may be much longer and may include other residues. 

Non-peptide linkers are also possible. For example, alkyl linkers 
such as -NrHOy.-CtO)-, wherein s = 2-20 could be used. These alkyl 
1 o linkers may further be substituted by any non-sterically hindering group 
such as lower alkyl (e.g., C,-C 6 ) lower acyl, halogen (e.g., CI, Br), CN, NH,, 
phenyl, ete. An exemplary non-peptide linker is a PEG linker, 
VI 




wherein n is such that the linker has a molecular weight of 100 to 5000 kD, 
preferably 100 to 500 kD. The peptide linkers may be altered to form 
derivatives in the same manner as described above. 

Derivatives . The inventors also contemplate derivatizing the 
2 0 peptide and/or vehicle portion of the compounds. Such derivatives may 
improve the solubility, absorption, biological half life, and the like of the 
compounds. The moieties may alternatively eliminate or attenuate any 
undesirable side-effect of the compounds and the like. Exemplary 
derivatives include compounds in which: 
25 1. The compound or some portion thereof is cyclic. For example, the 

peptide portion may be modified to contain two or more Cys residues 
(e.g., in the linker), which could cydize by disulfide bond formation. 
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For citations to references on preparation of cyclized derivatives, see 
Table 2. 

2. The compound is cross-linked or is rendered capable of cross-linking 
between molecules. For example, the peptide portion may be modified 
5 to contain one Cys residue and thereby be able to form an 

intermolecular disulfide bond with a like molecule. The compound 
may also be cross-linked through its C-terminus, as in the molecule 
shown below. 

vn 

F 1 -(X 1 ) b -CO-N ^Y%H 2 

F 1 -(X 1 ) b -CO-B^YNH 

O 

10 3. 

4 . One or more peptidyl [-C(0)NR-] linkages (bonds) is replaced by a 
non-peptidyl linkage. Exemplary non-peptidyl linkages are -CH 2 - 
carbamate [-CH.-OCCONR-J, phosphonate , -(^-sulfonamide [-CH,- 
StOysIR-], urea [-NHC(0)NH-], -CH.-secondary amine, and alkylated 

15 peptide [-C(0)NR*- wherein R 6 is lower alkyl]. 

5. The N-terminus is derivatized. Typically, the N-terminus may be 
acylated or modified to a substituted amine. Exemplary N-terminal 
derivative groups include -NRR 1 (other than -NH,), -NRC^R 1 , 
-NRC(0)OR*, -NRS(0)^', -NHC(0)NHR', succinimide, or 

20 benzyloxycarbonyl-NH-(CBZ-NH-), wherein R and R 1 are each 

independently hydrogen or lower alkyl and wherein the phenyl ring 
may be substituted with 1 to 3 substituents selected from the group 
consisting of C,-C 4 alkyl, Q-C 4 alkoxy, chloro, and bromo. 

6. The free C-terminus is derivatized. Typically, the C-terminus is 

25 esterified or amidated. For example, one may use methods described in 
the art to add (NH-CH^-NH,), to compounds of this invention 

(4> 
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having any of SEQ ID NOS: 504 to 508 at the C-terminus. Likewise, 
one may use methods described in the art to add -NI^ to compounds 
of this invention having any of SEQ ID NOS: 924 to 955, 963 to 972, 
1005 to 1013, or 1018 to 1023 at the C-tenninus. Exemplary C-terminal 
5 derivative groups include, for example, -C(0)R 2 wherein R 2 is lower 

alkoxy or -NR 3 R 4 wherein R 3 and R 4 are independently hydrogen or C r 
C 8 alkyl (preferably Q-C, alkyl). 

7. A disulfide bond is replaced with another, preferably more stable, 
cross-linking moiety (e.g., an alkylene). See, e.g., Bhatnagar etal. 

1 o (1996), I. Med.Chem . 39: 3814-9; Alberts etal. (1993) Thirteenth Am. 

Pep. Svmp ., 357-9. 

8. One or more individual amino acid residues is modified. Various 
derivatizing agents are known to react specifically with selected 
sidechains or terminal residues, as described in detail below. 

1 5 Lysinyl residues and amino terminal residues may be reacted with 

succinic or other carboxylic add anhydrides, which reverse the charge of the 
lysinyl residues. Other suitable reagents for derivatizing alpha-amino- 
containing residues include imidoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic 

2 0 acid; O-methylisourea; 2,4 pentanedione; and transaminase-catalyzed reaction 

with glyoxylate. 

Arginyl residues may be modified by reaction with any one or 
combination of several conventional reagents, including phenylglyoxal, 2,3- 
butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginyl 
2 5 residues requires that the reaction be performed in alkaline conditions because 
of the high pKa of the guanidine functional group. Furthermore, these reagents 
may react with the groups of lysine as well as the argiriinrepsilon-amino - 
group. 

0 
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Specific modification of tyrosyl residues has been studied extensively, 
with particular interest in introducing spectral labels into tyrosyl residues by 
reaction with aromatic diazonium compounds or tetranitromethane. Most 
commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl 

5 tyrosyl species and 3-nitro derivatives, respectively. 

Carboxyl sidechain groups (aspartyl or glutamyl) may be selectively 
modified by reaction with carbodiimides (R-N=ON-R0 such as 1-cyclohexyl- 
3-(2-morpholinyl-(4-ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4- 
dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues 

10 may be converted to asparaginyl and glutaminyl residues by reaction with 

ammonium ions. 

Glutaminyl and asparaginyl residues may be deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues 
are deamidated under mildly acidic conditions. Either form of these residues 
1 5 falls within the scope of this invention. 

Cysteinyl residues can be replaced by amino acid residues or other 
moieties either to eliminate disulfide bonding or, conversely, to stabilize cross- 
linking. See, e.g., Bhatnagar etal. (1996), j.Med.Chem . 39: 3814-9. 

Derivatization with bifunctional agents is useful for cross-linking the 
20 peptides or their functional derivatives to a water-insoluble support matrix or 
to other macromolecular vehicles. Commonly used cross-linking agents 
include, e.g., l / l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hyciroxysuccinimide esters, for example, esters with 4-azidosalicylic add, 
homobifunctional imidoesters, including disuccinimidyl esters such as 3,3- 
2 5 dithiobis(succinimidylpropionate), and bifunctional maleimides such as bis-N- 
maleimido-l,8-octane. Derivatizing agents such as methyl-3-[(p- 
azidophenyl)ditWo]propioimidate yield photoactivatable mtermediates that are 
capable of forming crosslinks in the presence of light. Alternatively, reactive 
water-insoluble matrices such as cyanogen bromide-activated carbohydrates 
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and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016; 
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein 
immobilization. 

Carbohydrate (oligosaccharide) groups may conveniently be 
5 attached to sites that are known to be glycosylation sites in proteins. 
Generally, O-linked oUgosaccharides are attached to serine (Ser) or 
threonine (Thr) residues while N-linked oUgosaccharides are attached to 
asparagine (Asn) residues when they are part of the sequence Asn-X- 
Ser/Thr, where X can be any amino acid except proline. X is preferably 

1 o one of the 19 naturally occurring amino acids other than proline. The 

structures of N-linked and O-linked oUgosaccharides and the sugar 
residues found in each type are different. One type of sugar that is 
commonly found on both is N-acetymeuraminic acid (referred to as siaUc 
acid). SiaUc acid is usuaUy the terminal residue of both N-linked and O- 
1 5 Unked oligosaccharides and, by virtue of its negative charge, may confer 
acidic properties to the glycosylated compound. Such site(s) may be 
incorporated in the linker of the compounds of this invention and are 
preferably glycosylated by a ceU during recombinant production of the 
polypeptide compounds (e.g., in mammaUan cells such as CHO, BHK, 

2 0 COS). However, such sites may further be glycosylated by synthetic or 

semi-synthetic procedures known in the art. 

Other possible modifications include hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, 
oxidation of the sulfur atom in Cys, methylation of the alpha-amino 
25 groups of lysine, arginine, and histidine side chains. Creighton, ProteM 
fihmrhirP and Mol ecule Properties (W. H. Freeman & Co., San Francisco), 

pp. 79-86 (1983). 

Compounds of the present invention may be changed at the DNA 
level, as weU. The DNA sequence of any portion of the compound may be 
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changed to codons more compatible with the chosen host cell. For E. coli, 
which is the preferred host cell, optimized codons are known in the art. 
Codons may be substituted to eliminate restriction sites or to include silent 
restriction sites, which may aid in processing of the DNA in the. selected 
5 host cell. The vehicle, linker and peptide DNA sequences may be modified 
to include any of the foregoing sequence changes. 
Methods of Making 

The compounds of this invention largely may be made in 
transformed host cells using recombinant DNA techniques. To do so, a 

1 0 recombinant DNA molecule coding for the peptide is prepared. Methods 
of preparing such DNA molecules are well known in the art. For instance, 
sequences coding for the peptides could be excised from DNA using 
suitable restriction enzymes. Alternatively, the DNA molecule could be 
synthesized using chemical synthesis techniques, such as the 

1 5 phosphoramidate method. Also, a combination of these techniques could 
be used. 

The invention also includes a vector capable of expressing the 
peptides in an appropriate host. The vector comprises the DNA molecule 
that codes for the peptides operatively linked to appropriate expression 

2 0 control sequences. Methods of effecting this operative linking, either 
before or after the DNA molecule is inserted into the vector, are well 
known. Expression control sequences include promoters, activators, 
enhancers, operators, ribosomal binding sites, start signals, stop signals, 
cap signals, polyadenylation signals, and other signals involved with the 

2 5 control of transcription or translation. 

The resulting vector having the DNA molecule thereon is used to 
transform an appropriate host. This transformation may be performed 
using methods well known in the art. 
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Any of a large number of available and well-known host cells may 
be used in the practice of this invention. The selection of a particular host 
is dependent upon a number of factors recognized by the art. These 
include, for example, compatibility with the chosen expression vector, 
5 toxicity of the peptides encoded by the DNA molecule, rate of 

transformation, ease of recovery of the peptides, expression characteristics, 
bio-safety and costs. A balance of these factors must be struck with the 
understanding that not all hosts may be equally effective for the 
expression of a particular DNA sequence. Within these general guidelines, 
1 0 useful microbial hosts include bacteria (such as E. coli sp.), yeast (such as 
Saccharomyces sp.) and other fungi, insects, plants, mammalian (including 
human) cells in culture, or other hosts known in the art. 

Next, the transformed host is cultured and purified. Host cells may 
be cultured under conventional fermentation conditions so that the 
1 5 desired compounds are expressed. Such fermentation conditions are well 
known in the art. Finally, the peptides are purified from culture by 
methods well known in the art. 

The compounds may also be made by synthetic methods. For 
example, solid phase synthesis techniques may be used. Suitable 
2 0 techniques are well known in the art, and include those described in 
Merrifield (1973), Chem. Polypeptides, pp. 335-61 (Katsoyannis and 
Panayotis eds.); Merrifield (1963), T. Am. Chem. Soc . 85: 2149; Davis etal. 
(1985), Biochem. Intl . 10: 394414; Stewart and Young (1969), Solid Phase 
Peptide Synthesis: U.S. Pat. No. 3,941,763; Finn etal. (1976), The Proteins 
2 5 (3rd ed.) 2: 105-253; and Erickson etal. (1976), The Proteins (3rd ed.) 2: 
257-527. Solid phase synthesis is the preferred technique of making 
individual peptides since it is the most cost-effective method of making 
small peptides. 
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Compounds that contain derivatized peptides or which contain 
non-peptide groups may be synthesized by well-known organic chemistry 
techniques. 



7> 
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Uses of the Compounds 

Ingeneral, The compounds of this invention have pharmacologic 
activity resulting from their ability to bind to proteins of interest as 
agonists, mimetics or antagonists of the native ligands of such proteins of 
5 interest. The utility of specific compounds is shown in Table 2. The activity 
of these compounds can be measured by assays known in the art. For the 
TPO-mimetic and EPO-mimetic compounds, in vivo assays are further 
described in the Examples section herein. 

In addition to therapeutic uses, the compounds of the present 
1 0 invention are useful in diagnosing diseases characterized by dysfunction 
of their associated protein of interest. In one embodiment, a method of 
detecting in a biological sample a protein of interest (e.g., a receptor) that 
is capable of being activated comprising the steps of: (a) contacting the 
sample with a compound of this invention; and (b) detecting activation of 
15 the protein of interest by the compound. The biological samples include 
tissue specimens, intact cells, or extracts thereof. The compounds of this 
invention may be used as part of a diagnostic kit to detect the presence of 
their associated proteins of interest in a biological sample. Such kits 
employ the compounds of the invention having an attached label to allow 
2 0 for detection. The compounds are useful for identifying normal or 
abnormal proteins of interest. For the EPO-mimetic compounds, for 
example, presence of abnormal protein of interest in a biological sample 
may be indicative of such disorders as Diamond Blackfan anemia, where it 
is believed that the EPO receptor is dysfunctional. 
2 5 Therapeutic uses of EPO-mimetic comp ounds. The EPO-mimetic 

compounds of the invention are useful for treating disorders characterized 
by low red blood cell levels. Included in the invention are methods of 
modulating the endogenous activity of an EPO receptor in a mammal, 
preferably methods of increasing the activity of an EPO receptor. In 

V 
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general, any condition treatable by erythropoietin, such as anemia, may 
also be treated by the EPO-mimetic compounds of the invention. These 
compounds are administered by an amount and route of delivery that is 
appropriate for the nature and severity of the condition being treated and 
may be ascertained by one skilled in the art. Preferably, administration is 
by injection, either subcutaneous, intramuscular, or intravenous. 

Thera peutic uses o f TTQj^mgtic rnmpounds. For the TPO- 
xnimetic compounds, one can utilize such standard assays as those 
described in W095/26746 entitled "Compositions and Methods for 
Stimulating Megakaryocyte Growth and Differentiation". Injnyo assays 
also appear in the Examples hereinafter. 

The conditions to be treated are generally those that involve an 
existing megakaryocyte/platelet deficiency or an expected 
xnegakaryocyte/platelet deficiency (e.g., because of planned surgery or 
platelet donation). Such conditions will usually be the result of a 
deficiency (temporary or permanent) of active Mpl Hgand invivo. The 
generic term for platelet deficiency is thrombocytopenia, and hence the 
methods and compositions of the present invention are generally available 
for treating thrombocytopenia in patients in need thereof. 

Thrombocytopenia (platelet deficiencies) may be present for 
various reasons, including chemotherapy and other therapy with a variety 
of drugs, radiation therapy, surgery, accidental blood loss, and other 
specific disease conditions. Exemplary specific disease conditions that 
involve thrombocytopenia and may be treated in accordance with tins 
invention are: aplastic anemia, idiopathic thrombocytopenia, metastatic 
tumors which resultin thrombocytopenia, systemic lupus erythematosus, 
splenomegaly, Fanconfs syndrome, vitamin B12 deficiency; folic acid 
defidency,May-Hegglmanomaly,Wisko tt -Aldrichsyndrome,^ 

paroxysmalnocturnalhemoglobinuria. Also, certain treatments for AIDS 
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result in thrombocytopenia (e.g., AZT). Certain wound healing disorders 
might also benefit from an increase in platelet numbers. 

With regard to anticipated platelet deficiencies, e.g., due to future 
surgery, a compound of the present invention could be administered 
5 several days to several hours prior to the need for platelets. With regard 
to acute situations, e.g., accidental and massive blood loss, a compound of 
this invention could be administered along with blood or purified 
platelets. 

The TPO-mimetic compounds of this invention may also be useful in 
1 0 stimulating certain cell types other than megakaryocytes if such cells are found 
to express Mpl receptor. Conditions associated with such cells that express the 
Mpl receptor, which are responsive to stimulation by the Mpl ligand, are also 
within the scope of this invention. 

The TPO-mimetic compounds of this invention may be used in any 
1 5 situation in which production of platelets or platelet precursor cells is desired, 
or in which stimulation of the c-Mpl receptor is desired. Thus, for example, the 
compounds of this invention may be used to treat any condition in a mammal 
wherein there is a need of platelets, megakaryocytes, and the like. Such 
conditions are described in detail in the following exemplary sources: 
2 0 W095/26746; W095/21919; W095/18858; WO95/21920 and are incorporated 
herein. 

The TPO-mimetic compounds of this invention may also be useful in 
maintaining the viability or storage life of platelets and/or megakaryocytes and 
related cells. Accordingly, it could be useful to include an effective amount of 
2 5 one or more such compounds in a composition containing such cells. 

The therapeutic methods, compositions and compounds of the 
present invention may also be employed, alone or in combination with 
other cytokines, soluble Mpl receptor, hematopoietic factors, interleukins, 
growth factors or antibodies in the treatment of disease states 
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characterized by other symptoms as well as platelet deficiencies. It is 
anticipated that the inventive compound will prove useful in treating 
some forms of thrombocytopenia in combination with general stimulators 
of hematopoiesis, such as IL-3 or GM-CSF. Other megakaryocyte 
5 stimulatory factors, i.e., meg-CSF, stem cell factor (SCF), leukemia 
inhibitory factor (LIF), oncostatin M (OSM), or other molecules with 
megakaryocyte stimulating activity may also be employed with Mpl 
ligand. Additional exemplary cytokines or hematopoietic factors for such 
co-administration include IL-1 alpha, IL-1 beta, IL-2, IL-3, IL-4, IL-5, IL-6, 

1 0 IL-11, colony stimulating factor-1 (CSF-1), SCF, GM-CSF, granulocyte 
colony stimulating factor (G-CSF), EPO, interferon-alpha (IFN-alpha), 
consensus interferon, IFN-beta, or IFN-gamma. It may further be useful to 
administer, either simultaneously or sequentially, an effective amount of a 
soluble mammalian Mpl receptor, which appears to have an effect of 

1 5 causing megakaryocytes to fragment into platelets once the 

megakaryocytes have reached mature form. Thus, administration of an 
inventive compound (to enhance the number of mature megakaryocytes) 
followed by administration of the soluble Mpl receptor (to inactivate the 
ligand and allow the mature megakaryocytes to produce platelets) is 

2 0 expected to be a particularly effective means of stimulating platelet 

production. The dosage recited above would be adjusted to compensate 
for such additional components in the therapeutic composition. Progress 
of the treated patient can be monitored by conventional methods. 

In cases where the inventive compounds are added to compositions 

25 of platelets and/or megakaryocytes and related cells, the amount to be 
included will generally be ascertained experimentally by techniques and 
assays known in the art. An exemplary range of amounts is~0.1 ug— 1 mg 
inventive compound per 10 6 cells. 
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Pharmaceutical Compositions 

In General . The present invention also provides methods of using 
pharmaceutical compositions of the inventive compounds. Such 
pharmaceutical compositions may be for administration for injection, or for 
5 oral, pulmonary, nasal, transdermal or other forms of administration. In 
general, the invention encompasses pharmaceutical compositions comprising 
effective amounts of a compound of the invention together with 
pharmaceutically acceptable diluents, preservatives, solubilizers, emulsifiers, 
adjuvants and/or carriers. Such compositions include diluents of various 

1 o buffer content (e.g., Tris-HCl, acetate, phosphate), pH and ionic strength; 

additives such as detergents and solubilizing agents (e.g., Tween 80, 
Polysorbate 80), antioxidants (e.g., ascorbic acid, sodium metabisulfite), 
preservatives (e.g., Thimersol, benzyl alcohol) and bulking substances (e.g., 
lactose, mannitol); incorporation of the material into particulate preparations of 
1 5 polymeric compounds such as polylactic acid, polyglycolic acid, etc. or into 
liposomes. Hyaluronic acid may also be used, and this may have the effect of 
promoting sustained duration in the circulation. Such compositions may 
influence the physical state, stability, rate of in vivo release, and rate of in vivo 
clearance of the present proteins and derivatives. See, e.g., Remington's 

2 0 Pharmaceutical Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA 

18042) pages 1435-1712 which are herein incorporated by reference. The 
compositions may be prepared in liquid form, or may be in dried powder, such 
as lyophilized form. Implantable sustained release formulations are also 
contemplated, as are transdermal formulations. 
2 5 Oral dosage forms . Contemplated for use herein are oral solid 

dosage forms, which are described generally in Chapter 89 of Remjnjtgivs 
PharWntical Sciences (1990), 18th Ed., Mack Publishing Co. Easton PA - 
18042, which is herein incorporated by reference. Solid dosage forms 
include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 

?? 
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liposomal or proteinoid encapsulation may be used to formulate the 
present compositions (as, for example, proteinoid microspheres reported 
in U.S. Patent No. 4,925,673). Liposomal encapsulation may be used and 
the liposomes may be derivatized with various polymers (e.g., U.S. Patent 
No. 5,013,556). A description of possible solid dosage forms for the 
therapeutic is given in Chapter 10 of Marshall, K., Modern Pharmaceutics 
(1979), edited by G. S. Banker and C. T. Rhodes, herein incorporated by 
reference. In general, the formulation will include the inventive 
compound, and inert ingredients which allow for protection against the 
stomach environment, and release of the biologically active material in the 
intestine. 

Also specifically contemplated are oral dosage forms of the above 
inventive compounds. If necessary, the compounds may be chemically 
modified so that oral delivery is efficacious. Generally, the chemical 
modification contemplated is the attachment of at least one moiety to the 
compound molecule itself, where said moiety permits (a) inhibition of 
proteolysis; and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall stability of the compound 
and increase in circulation time in the body. Moieties useful as covalently 
attached vehicles in this invention may also be used for this purpose. 
Examples of such moieties include: PEG, copolymers of ethylene glycol 
and propylene glycol, carboxymethyl cellulose, dextran, polyvinyl alcohol, 
polyvinyl pyrrolidone and polyproline. See, for example, Abuchowski and 
Davis, Soluble Polymer-Enzyme Adducts, Enzymes as Drugs (1981), 
Hocenberg and Roberts, eds., Wiley-Interscience, New York, NY, , pp 367- 
83; Newmark, etal. (1982), T. Appl. Biochem . 4:185-9. Other polymers that 
could be used are poly-l,3-dioxolane and poly-13/6-tioxocane. Preferred 
for pharmaceutical usage, as indicated above, are PEG moieties. 
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For oral delivery dosage forms, it is also possible to use a salt of a 
modified aliphatic amino acid, such as sodium N-(8-[2-hydroxybenzoylJ 
amino) caprylate (SNAC), as a carrier to enhance absorption of the 
therapeutic compounds of this invention. The clinical efficacy of a heparin 
5 formulation using SNAC has been demonstrated in a Phase II trial 

conducted by Emisphere Technologies. See US Patent No. 5,792,451, "Oral 
drug delivery composition and methods". 

The compounds of this invention can be included in the 
formulation as fine multiparticulates in the form of granules or pellets of 

1 0 particle size about 1 mm. The formulation of the material for capsule 
administration could also be as a powder, lightly compressed plugs or 
even as tablets. The therapeutic could be prepared by compression. 

Colorants and flavoring agents may all be included. For example, 
the protein (or derivative) may be formulated (such as by liposome or 

1 5 microsphere encapsulation) and then further contained within an edible 
product, such as a refrigerated beverage containing colorants and 
flavoring agents. 

One may dilute or increase the volume of the compound of the 
invention with an inert material. These diluents could include 

2 0 carbohydrates, especially mannitol, a-lactose, anhydrous lactose, cellulose, 
sucrose, modified dextrans and starch. Certain inorganic salts may also be 
used as fillers including calcium triphosphate, magnesium carbonate and 
sodium chloride. Some commercially available diluents are Fast-Flo, 
Emdex, STA-Rx 1500, Emcompress and Avicell. 

2 5 Disintegrants may be included in the formulation of the therapeutic 

into a solid dosage form. Materials used as disintegrants include but are 
not limited to starch including the commercial disintegranfbased on 
starch, Explotab. Sodium starch glycolate, Amberlite, sodium 
carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, orange 
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peel, acid carboxymethyl cellulose, natural sponge and bentonite may all 
be used. Another form of the disintegrants are the insoluble cationic 
exchange resins. Powdered gums may be used as disintegrants and as 
binders and these can include powdered gums such as agar, Karaya or 
5 tragacanth. Alginic add and its sodium salt are also useful as 
disintegrants. 

Binders may be used to hold the therapeutic agent together to form 
a hard tablet and include materials from natural products such as acacia, 
tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 

1 0 cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone 
(PVP) and hydroxypropylmethyl cellulose (HPMC) could both be used in 
alcoholic solutions to granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the 
therapeutic to prevent sticking during the formulation process. Lubricants 

1 5 may be used as a layer between the therapeutic and the die wall, and these 
can include but are not limited to; stearic acid including its magnesium 
and calcium salts, polytetrafluoroethylene (PTFE), liquid paraffin, 
vegetable oils and waxes. Soluble lubricants may also be used such as 
sodium lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of 

2 0 various molecular weights, Carbowax 4000 and 6000. 

Glidants that might improve the flow properties of the drug during 
formulation and to aid rearrangement during compression might be 
added. The glidants may include starch, talc, pyrogenic silica and 
hydrated silicoaluminate. 

25 To aid dissolution of the compound of this invention into the 

aqueous environment a surfactant might be added as a wetting agent. 
Surfactants may include anionic detergents such as sodiumiauryl sulfate, 
dioctyl sodium sulfosuccinate and dioctyl sodium sulfonate. Cationic 
detergents might be used and could include benzalkonium chloride or 
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benzethonium chloride. The list of potential nonionic detergents that 
could be included in the formulation as surfactants are lauromacrogol 400, 
polyoxyl 40 stearate, polyoxyethylene hydrogenated castor oil 10, 50 and 
60, glycerol monostearate, polysorbate 40, 60, 65 and 80, sucrose fatty acid 
5 ester, methyl cellulose and carboxymethyl cellulose. These surfactants 
could be present in the formulation of the protein or derivative either 
alone or as a mixture in different ratios. 

Additives may also be included in the formulation to enhance 
uptake of the compound. Additives potentially having this property are 

1 0 for instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Controlled release formulation may be desirable. The compound of 
this invention could be incorporated into an inert matrix which permits 
release by either diffusion or leaching mechanisms e.g., gums. Slowly 
degenerating matrices may also be incorporated into the formulation, e.g., 

1 5 alginates, polysaccharides. Another form of a controlled release of the 

compounds of this invention is by a method based on the Oros therapeutic 
system (Alza Corp.), i.e., the drug is enclosed in a semipermeable 
membrane which allows water to enter and push drug out through a 
single small opening due to osmotic effects. Some enteric coatings also 

2 0 have a delayed release effect. 

Other coatings may be used for the formulation. These include a 
variety of sugars which could be applied in a coating pan. The therapeutic 
agent could also be given in a film coated tablet and the materials used in 
this instance are divided into 2 groups. The first are the nonenteric 

2 5 materials and include methyl cellulose, ethyl cellulose, hydroxyethyl 
cellulose, methylhydroxy-ethyl cellulose, hydroxypropyl cellulose, 
hydroxypropyl-methyl cellulose, sodium carboxy-methyl cellulose, 
providone and the polyethylene glycols. The second group consists of the 
enteric materials that are commonly esters of phthalic add. 
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A mix of materials might be used to provide the optimum film 
coating. Film coating may be carried out in a pan coater or in a fluidized 
bed or by compression coating. 

Pulmonary delivery forms . Also contemplated herein is pulmonary 
5 delivery of the present protein (or derivatives thereof). The protein (or 
derivative) is delivered to the lungs of a mammal while inhaling and 
traverses across the lung epithelial lining to the blood stream. (Other 
reports of this include Adjei etal., Pharma. Res . (1990) 7: 565-9; Adjei etal. 
(1990), Tnternatl. 1. Pharmaceutics 63: 13^44 (leuprolide acetate); Braquet 

1 0 etal. (1989), T. Cardiovasc Pharmacol . 13 (suppl.5): s.143-146 (endothelin- 
1); Hubbard etal. (1989), Annals Int. Med . 3: 206-12 (al-antitrypsin); Smith 
etal . (1989), T. Clin. Invest . 84: 1145-6 (al-proteinase); Oswein etal. (March 
1990), "Aerosolization of Proteins", Proc. Svmp. Resp. Drug Delivery II, 
Keystone, Colorado (recombinant human growth hormone); Debs etal. 

1 5 (1988), T. Immunol . 140: 3482-8 (interferon-y and tumor necrosis factor a) 
and Platz eLal., U.S. Patent No. 5,284,656 (granulocyte colony stimulating 
factor). 

Contemplated for use in the practice of this invention are a wide 
range of mechanical devices designed for pulmonary delivery of 

2 0 therapeutic products, including but not limited to nebulizers, metered 
dose inhalers, and powder inhalers, all of which are familiar to those 
skilled in the art. Some specific examples of commercially available 
devices suitable for the practice of this invention are the Ultravent 
nebulizer, manufactured by Mallinckrodt, Inc., St. Louis, Missouri; the 

2 5 Acorn H nebulizer, manufactured by Marquest Medical Products, 

Englewood, Colorado; the Ventolin metered dose inhaler, manufactured 
by Giaxo Inc., Research Triangle Park, North Carolina; ahdTthe Spinhaler 
powder inhaler, manufactured by Fisons Corp., Bedford, Massachusetts. 
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All such devices require the use of formulations suitable for the 
dispensing of the inventive compound. Typically, each formulation is 
specific to the type of device employed and may involve the use of an 
appropriate propellant material, in addition to diluents, adjuvants 
5 and/or carriers useful in therapy. 

The inventive compound should most advantageously be 
prepared in particulate form with an average particle size of less than 10 
ym (or microns), most preferably 0.5 to 5 jim, for most effective delivery 
to the distal lung. 

1 0 Pharmaceutical^ acceptable carriers include carbohydrates such 

as trehalose, mannitol, xylitol, sucrose, lactose, and sorbitol. Other 
ingredients for use in formulations may include DPPC, DOPE, DSPC and 
DOPC. Natural or synthetic surfactants may be used. PEG may be used 
(even apart from its use in derivatizing the protein or analog). Dextrans, 

1 5 such as cyclodextran, may be used. Bile salts and other related enhancers 
may be used. Cellulose and cellulose derivatives may be used. Amino 
adds may be used, such as use in a buffer formulation. 

Also, the use of liposomes, microcapsules or microspheres, 
inclusion complexes, or other types of carriers is contemplated. 

2 0 Formulations suitable for use with a nebulizer, either jet or 

ultrasonic, will typically comprise the inventive compound dissolved in 
water at a concentration of about 0.1 to 25 mg of biologically active protein 
per mL of solution. The formulation may also include a buffer and a 
simple sugar (e.g., for protein stabilization and regulation of osmotic 

2 5 pressure). The nebulizer formulation may also contain a surfactant, to 
reduce or prevent surface induced aggregation of the protein caused by 
atomization of the solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will 
generally comprise a finely divided powder containing the inventive 
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compound suspended in a propellant with the aid of a surfactant The 
propellant may be any conventional material employed for this purpose, 
such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
hydrofluorocarbon, or a hydrocarbon, including trichlorofluoromethane, 
5 dichlorodifluoromethane, dichlorotetrafluoroethanol, and 1,1,1,2- 

tetrafluoroethane, or combinations thereof. Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a 
surfactant. 

Formulations for dispensing from a powder inhaler device will 

1 0 comprise a finely divided dry powder containing the inventive compound 
and may also include a bulking agent, such as lactose, sorbitol, sucrose, 
mannitol, trehalose, or xylitol in amounts which facilitate dispersal of the 
powder from the device, e.g., 50 to 90% by weight of the formulation. 

Nasal delivery forms . Nasal delivery of the inventive compound is 

1 5 also contemplated. Nasal delivery allows the passage of the protein to the 
blood stream directly after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. 
Formulations for nasal delivery include those with dextran or 
cyclodextran. Delivery via transport across other mucous membranes is 

2 0 also contemplated. 

Dosages . The dosage regimen involved in a method for treating the 
above-described conditions will be determined by the attending physician, 
considering various factors which modify the action of drugs, e.g. the age, 
condition, body weight, sex and diet of the patient, the severity of any infection, 

2 5 time of administration and other clinical factors. Generally, the daily regimen 
should be in the range of 0.1-1000 micrograms of the inventive compound per 
kilogram of body weight, preferably 0.1-150 micrograms per kilogram. 
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Specific preferred embodiments 

The inventors have determined preferred peptide sequences for 
molecules having many different kinds of activity. The inventors have 
further determined preferred structures of these preferred peptides 
5 combined with preferred linkers and vehicles. Preferred structures for 
these preferred peptides listed in Table 21 below. 



Table 21— Preferred embodiments 



Seauence/structure 


SEQ 
ID 


Activity 


F'-(G),-IEGPTLRQWLAARA-(G) n -IEGPTLRQWLAARA 


337 


TPO-mimetic 


IEGPTLRQWLAARA-(G)„-IEGPTLRQWLAARA-(G)«-F' 


338 


TPO-mimetic 


F'-{G) 5 -IEGPTLRQWLAARA 


1032 


TPO-mimetic 


IEGPTLRQWLAARA -(G),- F 


1033 


TPO-mimetic 


F'-(G) 5 -GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPI TWVCKPQGG 


339 


EPO-mimetic 


GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG-(GVF' 


340 


EPO-mimetic 


GGTYSCHFGPLTWVCKPQGG-(G) S -F' 


1034 


EPO-mimetic 


F'-(G),-DFLPHYKNTSLGHRP 


1045 


TNF-a inhibitor 


DFLPHYKNTSLGHRP-(G) 5 -F' 


1046 


TNF-a inhibitor 


F'-fG),- FEWTPGYWQPYALPL 


1047 


IL-1 R antagonist 


FEWTPGYWQPYALPL-(G) 5 -F' 


1048 


IL-1 R antagonist 


F'-(G) 6 -VEPNCDIHVMWEWECFERL 


1049 


VEGF-antagonist 


VEPNCDIHVMWEWECFERL-(G) S -F' 


1050 


VEGF-antagonist 


F'-CGVCTTHWGFTLC 


1051 


MMP inhibitor 


CTTHWGFTLC-(G) 8 -F' 


1052 


MMP inhibitor 



F 1 " is an Fc domain as defined previously herein. 



Working examples 
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The compounds described above may be prepared as described 
below. These examples comprise preferred embodiments of the invention 
and are illustrative rather than limiting. 

Example 1 
TPO-Mimetics 

The following example uses peptides identified by the numbers 
appearing in Table A hereinafter. 

Preparation of peptide 19 . Peptide 17b (12 mg) and MeO-PEG-SH 
5000 (30 mg, 2 equiv.) were dissolved in 1 ml aqueous buffer (pH 8). The 
mixture was incubated at RT for about 30 minutes and the reaction was 
checked by analytical HPLC, which showed a > 80% completion of the 
reaction. The pegylated material was isolated by preparative HPLC. 

Preparation of peptide 20 . Peptide 18 (14 mg) and MeO-PEG- 
maleimide (25 mg) were dissolved in about 1.5 ml aqueous buffer (pH 8). 
The mixture was incubated at RT for about 30 minutes, at which time 
about 70% transformation was complete as monitored with analytical 
HPLC by applying an aliquot of sample to the HPLC column. The 
pegylated material was purified by preparative HPLC. 

Bioactivity assay . The TPO in vitro bioassay is a mitogenic assay 
utilizing an IL-3 dependent clone of murine 32D cells that have been 
transf ected with human mpl receptor. This assay is described in greater 
detail in WO 95/26746. Cells are maintained in MEM medium containing 
10% Fetal Clone II and 1 ng/ml mIL-3. Prior to sample addition, cells are 
prepared by rinsing twice with growth medium lacking mIL-3. An 
extended twelve point TPO standard curve is prepared, ranging from 33 
to 39 pg/ml. Four dilutions, estimated to fall within the linear portion of 
the standard curve, (100 to 125 pg/ml), are prepared for eatih sample and 
run in triplicate. A volume of 100 jil of each dilution of sample or 
standard is added to appropriate wells of a 96 well microtiter plate 
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containing 10,000 cells/well. After forty-four hours at 37 °C and 10% C0 2 , 
MTS (a tetrazolium compound which is bioreduced by cells to a formazan) 
is added to each well. Approximately six hours later, the optical density is 
read on a plate reader at 490 nm. A dose response curve (log TPO 
5 concentration vs. O.D.- Background) is generated and linear regression 
analysis of points which fall in the linear portion of the standard curve is 
performed. Concentrations of unknown test samples are determined 
using the resulting linear equation and a correction for the dilution factor. 
TMP tandem repeats with polyglvcine linkers . Our design of 

1 0 sequentially linked TMP repeats was based on the assumption that a 
dimeric form of TMP was required for its effective interaction with c-Mpl 
(the TPO receptor) and that depending on how they were wound up 
against each other in the receptor context, the two TMP molecules could 
be tethered together in the C- to N-terminus configuration in a way that 

1 5 would not perturb the global dimeric conformation. Clearly, the success 
of the design of tandem linked repeats depends on proper selection of the 
length and composition of the linker that joins the C- and N-termini of the 
two sequentially aligned TMP monomers. Since no structural information 
of the TMP bound to c-Mpl was available, a series of repeated peptides 

2 0 with linkers composed of 0 to 10 and 14 glycine residues (Table A) were 
synthesized. Glycine was chosen because of its simplicity and flexibility, 
based on the rationale that a flexible polyglycine peptide chain might 
allow for the free folding of the two tethered TMP repeats into the 
required conformation, while other amino acid sequences may adopt 

2 5 undesired secondary structures whose rigidity might disrupt the correct 
packing of the repeated peptide in the receptor context. 

The resulting peptides are readily accessible by conventional solid 
phase peptide synthesis methods (Merrifield (1963), ]. Amer. Chem. Soc. 
85: 2149) with either Fmoc or t-Boc chemistry. Unlike the synthesis of the 

*7 
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C-terminally linked parallel dimer which required the use of an 
orthogonally protected lysine residue as the initial branch point to build 
the two peptide chains in a pseudosymmetrical way (Cwirla et al . (1997), 
Science 276: 1696-9), the synthesis of these tandem repeats was a 
5 straightforward, stepwise assembly of the continuous peptide chains from 
the C- to N-terminus. Since dimerization of TMP had a more dramatic 
effect on the proliferative activity than binding affinity as shown for the C- 
terminal dimer (Cwirla et al , (1997)), the synthetic peptides were tested 
directly for biological activity in a TPO-dependent cell-proliferation assay 

1 0 using an IL-3 dependent clone of murine 32D cells transfected with the 
full-length c-Mpl (Palacios etal.,. Cell 41:727 (1985)), As the test results 
showed, all the polyglycine linked tandem repeats demonstrated >1000 
fold increases in potency as compared to the monomer, and were even 
more potent than the C-terminal dimer in this cell proliferation assay. The 

1 5 absolute activity of the C-terminal dimer in our assay was lower than that 
of the native TPO protein, which is different from the previously reported 
findings in which the C-terminal dimer was found to be as active as the 
natural ligand (Cwirla etal. (1997)). This might be due to differences in 
the conditions used in the two assays. Nevertheless, the difference in 

2 0 activity between tandem (C terminal of first monomer linked to N 

terminal of second monomer) and C-terminal (C terminal of first monomer 
linked to C terminal of second monomer; also referred to as parallel) 
dimers in the same assay clearly demonstrated the superiority of tandem 
repeat strategy over parallel peptide dimerization. It is interesting to note 

2 5 that a wide range of length is tolerated by the linker. The optimal linker 
between tandem peptides with the selected TMP monomers apparently is 
composed of 8 glycines. ~ '~ 

Other tandem repeats . Subsequent to this first series of TMP 
tandem repeats, several other molecules were designed either with 
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different linkers or containing modifications within the monomer itself. 
The first of these molecules, peptide 13, has a linker composed of GPNG, a 
sequence known to have a high propensity to form a p-turn-type 
secondary structure. Although still about 100-fold more potent than the 
5 monomer, this peptide was found to be >10-fold less active than the 
equivalent GGGG-linked analog. Thus, introduction of a relatively rigid 
P-turn at the linker region seemed to have caused a slight distortion of the 
optimal agonist conformation in this short linker form. 

The Trp9 in the IMP sequence is a highly conserved residue among 

10 the active peptides isolated from random peptide libraries. There is also a 
highly conserved Trp in the consensus sequences of EPO mimetic peptides 
and this Trp residue was found to be involved in the formation of a 
hydrophobic core between the two EMPs and contributed to hydrophobic 
interactions with the EPO receptor. Livnah etal. (1996), Science 273: 464- 

15 71). By analogy, the Trp9 residue in TMP might have a similar function in 
dimerization of the peptide ligand, and as an attempt to modulate and 
estimate the effects of noncovalent hydrophobic forces exerted by the two 
indole rings, several analogs were made resulting from mutations at the 
Trp. So in peptide 14, the Trp residue was replaced in each of the two 

2 0 TMP monomers with a Cys, and an intramolecular disulfide bond was 
formed between the two cysteines by oxidation which was envisioned to 
mimic the hydrophobic interactions between the two Trp residues in 
peptide dimerization. Peptide 15 is the reduced form of peptide 14. In 
peptide 16, the two Trp residues were replaced by Ala. As the assay data 

2 5 show, all three analogs were inactive. These data further demonstrated 
that Trp is critical for the activity of the TPO mimetic peptide, not just for 
dimer formation. 

The next two peptides (peptide 17a, and 18) each contain in their 8- 
amino acid linker a Lys or Cys residue. These two compounds are 
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precursors to the two PEGylated peptides (peptide 19 and 20) in which the 
side chain of the Lys or Cys is modified by a PEG moiety. A PEG moiety 
was introduced at the middle of a relatively long linker, so that the large 
PEG component (5 kDa) is far enough away from the critical binding sites 
5 in the peptide molecule. PEG is a known biocompatible polymer which is 
increasingly used as a covalent modifier to improve the pharmacokinetic 
profiles of peptide- and protein-based therapeutics. 

A modular, solution-based method was devised for convenient 
PEGylation of synthetic or recombinant peptides. The method is based on 

10 the now well established chemoselective ligation strategy which utilizes 
the specific reaction between a pair of mutually reactive functionalities. 
So, for pegylated peptide 19, the lysine side chain was preactivated with a 
bromoacetyl group to give peptide 17b to accommodate reaction with a 
thiol-derivatized PEG. To do that, an orthogonal protecting group, Dde, 

1 5 was employed for the protection of the lysine e-amine. Once the whole 
peptide chain was assembled, the N-terminal amine was reprotected with 
t-Boc. Dde was then removed to allow for the bromoacetylation. This 
strategy gave a high quality crude peptide which was easily purified using 
conventional reverse phase HPLC Ligation of the peptide with the thiol- 

2 0 modified PEG took place in aqueous buffer at pH 8 and the reaction 
completed within 30 minutes. MALDI-MS analysis of the purified, 
pegylated material revealed a characteristic, bell-shaped spectrum with an 
increment of 44 Da between the adjacent peaks. For PEG-peptide 20, a 
cysteine residue was placed in the linker region and its side chain thiol 

2 5 group would serve as an attachment site for a maleimide-containing PEG. 
Similar conditions were used for the pegylation of this peptide. As the 
assay data revealed, these two pegylated peptides had evefT higher in vitro - 
bioactivity as compared to their unpegylated counterparts. 



to 
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Peptide 21 has in its 8-amino acid linker a potential glycosylation 
motif, NGS. Since our exemplary tandem repeats are made up of natural 
amino acids linked by peptide bonds, expression of such a molecule in an 
appropriate eukaryotic cell system should produce a glycopeptide with 
5 the carbohydrate moiety added on the side chain carboxyamide of Asn. 
Glycosylation is a common post-translational modification process which 
can have many positive impacts on the biological activity of a given 
protein by increasing its aqueous solubility and in vivo stability. As the 
assay data show, incorporation of this glycosylation motif into the linker 

1 o maintained high bioactivity. The synthetic precursor of the potential 
glycopeptide had in effect an activity comparable to that of the -(G),- 
linked analog. Once glycosylated, this peptide is expected to have the 
same order of activity as the pegylated peptides, because of the similar 
chemophysical properties exhibited by a PEG and a carbohydrate moiety. 

1 5 The last peptide is a dimer of a tandem repeat. It was prepared by 

oxidizing peptide 18, which formed an intermolecular disulfide bond 
between the two cysteine residues located at the linker. This peptide was 
designed to address the possibility that TMP was active as a tetramer. The 
assay data showed that this peptide was not more active than an average 

20 tandem repeat on an adjusted molar basis, which indirectly supports the 
idea that the active form of TMP is indeed a dimer, otherwise dimerization 
of a tandem repeat would have a further impact on the bioactivity. 

In order to confirm the in vitro data in animals, one pegylated TMP 
tandem repeat (compound 20 in Table A) was delivered subcutaneously to 

25 normal mice via osmotic pumps. Time and dose-dependent increases 
were seen in platelet numbers for the duration of treatment. Peak platelet 
levels over 4-fold baseline were seen on day 8. A dose of 10" ug/kg/day of 
the pegylated TMP repeat produced a similar response to rHuMGDF 
(non-pegylated) at 100 ug/kg/day delivered by the same route. 

<?/ 
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Table A— TPO-mimetic Peptides 



Peptide 
No. 



Compound 



SEQID Relative 
NO: Potency 



TMP-(G), 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 

15 

16 

17a 

17b 

18 

19 

20 

21 

22 



TPO 

TMP monomer 
TMP C-C dimer 
-TMP: 
n = 0 
n = 1 
n = 2 
n = 3 
n = 4 
n = 5 
n = 6 
n = 7 
n = 8 
n = 9 
n = 10 
n = 14 

TMP-GPNG-TMP 

IEGPTLRQCLAARA-GGGGGGGG-IEGPTLRQCLAARA 



(cyclic) 

IEGPTLRQCLAARA-GGGGGGGG- 

IEGPTLRQCLAARA (linear) 

IEGPTLRQAJ-AARA-GGGGGGGG- 

IEGPTLRQALAARA 

TMP-GGGKGGGG-TMP 

TMP-GGGK(BrAc)GGGG-TMP 

TMP-GGGCGGGG-TMP 

TMP-GGGK(PEG)GGGG-TMP 

TMP-GGGC(PEG)GGGG-TMP 

TMP-GGGN*GSGG-TMP 

TMP-GGGCGGGG-TMP 
I 

TMP-GGGCGGGG-TMP 



13 



341 

342 

343 

344 

345 

346 

347 

348 

349 

350 

351 

352 

353 

354 

355 

356 

357 
358 
359 
360 
361 
362 
363- 

363 



+ 
+++- 



++++ 



ND 
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Discussion , It is well accepted that MGDF acts in a way similar to 
hGH, i.e., one molecule of the protein ligand binds two molecules of the 
receptor for its activation. Wells etal.(1996), Ann. Rev. Biochem . 65: 609- 
34. Now, this interaction is mimicked by the action of a much smaller 
5 peptide, TMP. However, the present studies suggest that this mimicry 
requires the concerted action of two TMP molecules, as covalent 
dimerization of TMP in either a C-C parallel or C-N sequential fashion 
increased the in vitro biological potency of the original monomer by a 
factor of greater than 10 3 . The relatively low biopotency of the monomer is 
1 0 probably due to inefficient formation of the noncovalent dimer. A 

preformed covalent repeat has the ability to eliminate the entropy barrier 
for the formation of a noncovalent dimer which is exclusively driven by 
weak, noncovalent interactions between two molecules of the small, 14- 
residue peptide. 

15 It is intriguing that this tandem repeat approach had a similar effect 

on enhancing bioactivity as the reported C-C dimerization is intriguing. 
These two strategies brought about two very different molecular 
configurations. The C-C dimer is a quasi-symmetrical molecule, while the 
tandem repeats have no such symmetry in their linear structures. Despite 

2 0 this difference in their primary structures, these two types of molecules 
appeared able to fold effectively into a similar biologically active 
conformation and cause the dimerization and activation of c-Mpl. These 
experimental observations provide a number of insights into how the two 
TMP molecules may interact with one another in binding to c-Mpl. First, 

25 the two C-termini of the two bound TMP molecules must be in relatively 
close proximity with each other, as suggested by data on the C-terminal 
dimer. Second, the respective N- and C-termini of the two TMP molecules 
in the receptor complex must also be very closely aligned with each other, 
such that they can be directly tethered together with a single peptide bond 
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to realize the near maximum activity-enhancing effect brought about by 
the tandem repeat strategy. Insertion of one or more (up to 14) glycine 
residues at the junction did not increase (or decrease) significantly the 
activity any further. This may be due to the fact that a flexible polyglycine 
5 peptide chain is able to loop out easily from the junction without causing 
any significant changes in the overall conformation. This flexibility seems 
to provide the freedom of orientation for the TMP peptide chains to fold 
into the required conformation in interacting with the receptor and 
validate it as a site of modification. Indirect evidence supporting this 

1 0 came from the study on peptide 13, in which a much more rigid b-turn- 
f orming sequence as the linker apparently forced a deviation of the 
backbone alignment around the linker which might have resulted in a 
slight distortion of the optimal conformation, thus resulting in a moderate 
(10-fold) decrease in activity as compared with the analogous compound 

1 5 with a 4-Gly linker. Third, Trp9 in TMP plays a similar role as Trpl3 in 
EMP, which is involved not only in peptide:peptide interaction for the 
formation of dimers but also is important for contributing hydrophobic 
forces in peptiderreceptor interaction. Results obtained with the W to C 
mutant analog, peptide 14, suggest that a covalent disulfide linkage is not 

2 0 sufficient to approximate the hydrophobic interactions provided by the 
Trp pair and that, being a short linkage, it might bring the two TMP 
monomers too close, therefore perturbing the overall conformation of the 
optimal dimeric structure. 

An analysis of the possible secondary structure of the TMP peptide 

25 can provide further understanding on the interaction between TMP and c- 
Mpl. This can be facilitated by making reference to the reported structure 
of the EPO mimetic peptide. Livnah etal. (1996), Science- 273:464-75 The 
receptor-bound EMP has a b-hairpin structure with a b-turn formed by the 
highly consensus Gly-Pro-Leu-Thr at the center of its sequence. Instead of 

ft 
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GPLT, TMP has a highly selected GPTL sequence which is likely to form a 
similar turn. However, this turn-like motif is located near the N-tenninal 
part in TMP. Secondary structure prediction using Chau-Fasman method 
suggests that the C-terminal half of the peptide has a tendency to adopt a 
5 helical conformation. Together with the highly conserved Trp at position 
9, this C-terminal helix may contribute to the stabilization of the dimeric 
structure. It is interesting to note that most of our tandem repeats are 
more potent than the C-terminal parallel dimer. Tandem repeats seem to 
give the molecule a better fit conformation than does the C-C parallel 

1 0 dimerization. The seemingly asymmetric feature of a tandem repeat 

might have brought it closer to the natural ligand which, as an asymmetric 
molecule, uses two different sites to bind two identical receptor molecules. 

Introduction of a PEG moiety was envisaged to enhance the in vivo 
activity of the modified peptide by providing it a protection against 

1 5 proteolytic degradation and by slowing down its clearance through renal 
filtration. It was unexpected that pegylation could further increase the in 
vitro bioactivity of a tandem repeated TMP peptide in the cell-based 
proliferation assay. 

Example 2 

20 F<?-TMP fusions 

TMPs (and EMPs as described in Example 3) were expressed in 
either monomelic or dimeric form as either N-terminal or C-terminal 
fusions to the Fc region of human IgGl. In all cases, the expression 
construct utilized the luxPR promoter promoter in the plasmid expression 

25 vector pAMG21. 

Fc-TMP. A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the TPOmimetic peptide was constructed • 
using standard PCR technology. Templates for PCR reactions were the 
pFc-A3 vector and a synthetic TMP gene. The synthetic gene was 
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constructed from the 3 overlapping oligonucleotides (SEQ ID NOS: 364, 
365, and 366, respectively) shown below: 

1842-97 AAA AAA GGA TCC TCG AGA TTA AGC ACG AGC AGC CAG CCA 

CTG ACG CAG AGT CGG ACC 

1842-98 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG ACT CTG CGT 

1842-99 CAG TGG CTG GCT GCT CGT GCT TAA TCT CGA GGA TCC TTT 

TTT 

These oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 367 and 368, respectively) shown 
below: 

AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

1 + + + + + + 60 

CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA 
a KGGGG GIEGPTLRQWLAARA 

TAATCTC GAGGATCC TTTTTT 

61 + + - 81 

ATTAGAGCTCCTAGGAAAAAA 
a * 

This duplex was amplified in a PCR reaction using 1842-98 and 1842-97 as 
the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers shown below (SEQ ID NOS: 369 and 370): 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1830-51 TTCGATACCA CCACCTCCAC CTTTACCCGG AGACAGGGAG AGGCTCTTCTGC 

The oligonucleotides 1830-51 and 1842-98 contain an overlap of 24 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1842-97. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for EMP-Fc herein. Clones were screened for the 
ability to produce the recombinant protein product and to possess the 
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gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3728. 

The nucleotide and amino acid sequences (SEQ ID NOS: 5 and 6) of 
the fusion protein are shown in Figure 7. 
5 Fc-TMP-TMP . A DNA sequence coding for the Fc region of human 

IgGl fused in-frame to a dimer of the TPO-mimetic peptide was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFc-A3 vector and a synthetic TMP-TMP gene. The synthetic 
gene was constructed from the 4 overlapping oligonucleotides (SEQ ID 
1 0 NOS: 371 to 374, respectively) shown below: 

1830-52 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG 

ACT CTG CGT CAG TGG CTG GCT GCT CGT GCT 

15 1830-53 ACC TCC ACC ACC AGC ACG AGC AGC CAG 

CCA CTG ACG CAG AGT CGG ACC 

1830-54 GGT GGT GGA GGT GGC GGC GGA GGT ATT GAG GGC CCA ACC 

20 CTT CGC CAA TGG CTT GCA GCA CGC GCA 

1830-55 AAA AAA AGG ATC CTC GAG ATT ATG CGC GTG CTG CAA GCC 

ATT GGC GAA GGG TTG GGC CCT CAA TAG CTC CGC CGC C 

The 4 oligonucleotides were annealed to form the duplex encoding an 
2 5 amino acid sequence (SEQ ID NOS: 375 and 376, respectively) shown 
below: 

30 a K G G r r r t * CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA ^ 

KGGGGGIEGPTLRQWLAARA - 

61 

35 CC ACCACCTCCACCGCCGCCTCCATA^^^ 120 

CGCGCA G G G ° 6 G 1 E G P T L R Q W L A A * 
A n 121 148 

* u GCGCGTATTAGAGCTCCTAGGAAAAAAA 

a R A *- 

This duplex was amplified in a PCR reaction using 1830-52 and 1830-55 as 
45 the sense and antisense primers. " 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 1216-52 and 1830-51 as described above for 
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Fc-TMP The full length fusion gene was obtained from a third PCR 
reaction using me outside primers 1216-52 and 1830-55. 

TOefmalPCRgeneproduet(thefuUlengthfusiongene)was 

^sted with restiiction endonudeases JSfesJ and 6* and then ligafed 
JomevecmrpAMG21andtia^ormedmmcon^.H^^ 
^censasdesaibedmexampleLaones were screened for titeamhty 

toproducetiverecombinarttprotamproducandtopossessmegene 

Jonhavmgmecorrectnud^tide^ence.Asmg.e such done was 

selected and designated Amgen strain #3727. 

^nudeotideandaminoaddsemrencesfSEQfDNOS^andmof 



10 



the fusion protein are shown in Figure 8. 

TMP-TMP-Fc. A DNA sequence coding for a tandem repeat of the 

TFO num p F technology . Templates for PCR reactions 

constructed using standard PCK tecnnoiogy y 

, the EMP-Fc plasmid from strain #3688 (see Example 3) and a 



" )din ga,eTMPdimer.1hesyntheticgeneforthe 

W.mrepea.wasconstiudedfrommeToverlappmgoUgonucleotides 



synthetic gene encoc 



20 1885-52 
1885-53 

25 18B5-54 



shown below (SEQ ID NOS: 377 to 383, respectively): 

"jtjT TTT CAT ATG ATC GAA GOT CCG ACT CTG CGT CAG TGG 
1885 . 53 AGO ACG AGC AGC CAG CCA CTG ACG CAG AGT CGG ACC TIC 

1885 " GAT CAT ATG 

CTG GCT GCT CGT GCT GGT GGA GGC GGT GGG GAC AAA ACT 

X885-55 CTG GCT GCT CGT GCT GGC GGT GGT GGC GGA GGG GGT GGC 

1 ATT GAG GGC CCA 

30 1fifl5 56 AAG CCA TTG GCG AAG GGT TGG GCC CTC AAT GCC ACC CCC 

1885-56 AAG ^ ACC gcc 

AC C CTT CGC CAA TGG CTT GCA GCA CGC GCA GGG GGA GGC 
1885-57 GGG GAC AAA ACT 

18 85-58 CCC ACC GCC TCC CCC TGC GCG TGC TGC _ 
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10 



15 



20 



TTTTTTCATATGATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTGGCGGT 

1 + + + + + + 60 

GTATACTAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGACCGCCA 
a MIEGPTLRQWLAARAGG 

GGTGGCGGAGGGGGTGGCATTGAGGGCCCAACCCTTCGCCAATGGCTGGCTGCTCGTGCT 

61 + + + + + + 120 

CCACCGCCTCCCCCACCGTAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTGCGCGT 
a GGGGGGIEGPTLRQWLAA RA 

GGTGGAGGCGGTGGGGACAAAACTCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAA 

121 .-+ + + + + + 180 

CCCCCTCCGCCACCC 

a GGGGGDKTLAARAGGGGGDK 

ACTCACACA 
181 189 



a T H T 

This duplex was amplified in a PCR reaction using 1885-52 and 1885-58 as 
the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with DNA from the EMP-Fc fusion strain #3688 (see Example 3) using the 

2 5 primers 1885-54 and 1200-54. The full length fusion gene was obtained 

from a third PCR reaction using the outside primers 1885-52 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and Bam HI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 

3 0 2596 cells as described for Fc-EMP herein. Clones were screened for the 

ability to produce the recombinant protein product and to possess the 
gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3798. 

The nucelotide and amino acid sequences (SEQ ID NOS: 9 and 10) 

3 5 of the fusion protein are shown in Figure 9. 

TMP-Fc . A DNA sequence coding for a monomer of the TPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
obtained fortuitously in the ligation in TMP-TMP-Fc, presumably due to 
the ability of primer 1885-54 to anneal to 1885-53 as well as to 1885-58. A 

4 0 single clone having the correct nucleotide sequence for the TMP-Fc 

construct was selected and designated Amgen strain #3788. 

IT 
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The nucleotide and amino add sequences (SEQ ID NOS: 11 and 12) 
of the fusion protein are shown in Figure 10. 

Ex pression in E.coli . Cultures of each of the pAMG21-Fc-fusion 
constructs in E.coli GM221 were grown at 37 °C in Luria Broth medium 
5 containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 
culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 

1 0 cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 
were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 

1 5 10% b-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

pAMG21 . The expression plasmid p AMG21 can be derived from 
the Amgen expression vector pCFM1656 (ATCC #69576) which in turn be 

2 0 derived from the Amgen expression vector system described in US Patent 
No. 4,710,473. The pCFM1656 plasmid can be derived from the described 
pCFM836 plasmid (Patent No. 4,710,473) by: 

(a) destroying the two endogenous Ndel restriction sites by end 
filling with T4 polymerase enzyme followed by blunt end 

25 ligation; 

(b) replacing the DNA sequence between the unique Aatn and Clal 
restriction sites containing the synthetic Pl prorii&ter with a 
similar fragment obtained from pCFM636 (patent No. 4,710,473) 
containing the PL promoter (see SEQ ID NO: 386 below); and 

tOD 
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(c) substituting the small DNA sequence between the unique Clal 

and Kpnl restriction sites with the oligonucleotide having the 

sequence of SEQ ID NO: 388. 

SEQIDNO:386: 
5 Aatu 

5' CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT- 

3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA- 

- AAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA - 
1 0 - TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCC ACAACTGTATTT - 



15 



20 



- TACCACTGGCGGTGATACTGAGCACAT 3 ' 
- ATGGTGACCGCCACTATGACTCGTGTAGC 5 ' 

Clai 

SEQ ID NO: 387: 

5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC 3 ' 
3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 1 

Oal Kpnl 



The expression plasmid pAMG21 can then be derived from pCFM1656 by 
making a series of site-directed base changes by PCR overlapping oligo 
mutagenesis and DNA sequence substitutions. Starting with the Bgin site 
(plasmid bp # 180) immediately 5' to the plasmid replication promoter 
2 5 PcopB and proceeding toward the plasmid replication genes, the base pair 
changes are as shown in Table B below. 



(oi 
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Table B — Base pair changes resulting in pAMG21 
P AMG2ibD# h p innr,FMi656 hp fihanqfln: to in PAMG31 



5 # 204 

# 428 

# 509 

# 617 

# 679 
10 # 980 

# 994 
#1004 
#1007 
#1028 

15 #1047 

#1178 

#1466 

#2028 

#2187 
20 #2480 

#2499-2502 
25 #2642 



#3435 
#3446 
30 # 3643 



T/A 

T/A 


C/G 


AT 


G/C 


G/C 


AT 




insert two G/C 


G/C 


T/A 


T/A 


C/G 


G/C 


AT 


AT 


C/G 


C/G 


T/A 


AT 


T/A 


C/G 


T/A 


G/C 


T/A 


G/C 


T/A 


G/C 


bp deletion 


C/G 


T/A 


AT 


T/A 


AQIG 


GTCA 


TCAC 


CAGT 


TfiCGAGC 


7 bp deletion 


AGGCTCG 




G/C 


AT 


G/C 


AT 


AT 


T/A 



/0> 
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The DNA sequence between the unique Aatn (position #4364 in 
pCFM1656) and SacH (position #4585 in pCFM1656) restriction sites is 
substituted with the DNA sequence (SEQ ID NO: 23) shown in Figures 
17 A and 17B. During the ligation of the sticky ends of this substitution 
5 DNA sequence, the outside Aat n and SacI I sites are destroyed. There are 
unique Aat n and SacH sites in the substituted DNA. 

GM221 (Amgen #2596) . The Amgen host strain #2596 is an E.coli K- 
12 strain derived from Amgen strain #393. It has been modified to contain 
both the temperature sensitive lambda repressor cI857s7 in the early ebg 
1 0 region and the lacl° repressor in the late ebg region (68 minutes). The 
presence of these two repressor genes aUows the use of this host with a 
variety of expression systems, however both of these repressors are 
irrelevant to the expression from luxP R . The untransformed host has no 
antibiotic resistances. 
15 The ribosome binding site of the cI857s7 gene has been modified to 

include an enhanced RBS. It has been inserted into the ebg operon 
between nucleotide position 1170 and 1411 as numbered in Genbank 
accession number M64441GbJBa with deletion of the intervening ebg 
sequence. The sequence of the insert is shown below with lower case 
2 0 letters representing the ebg sequences flanking the insert shown below 
(SEQ ID NO: 388): 

ttattttcgtGCGGCCGCACCATTATCACCGCCAGAGGTAAACTAGTCAACACGCACGGTGTTAGATATTTAT 
CCC TTGC GGTGATAGATTGAGC AC ATCGATTTGATTCTAGAAGGAGGGATAATATATGAGC AC AAAAAAGAAA 
C C ATTAACAC AAGAGC AGC TTGAGGAC GC ACGTCGC C TTAAAGC AAT TTATG AAAAAAAGAAAAAT GAAC T TG 

2 5 GCTTATCCCAGGMTCTGTCGCAGACAAGATGGGGATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCAT 

CAATGCATTAAATGCTTATAACGCCGCATTGCTTACAAAAATTCTCAAAGTTAGCGTTGAAGAATTTAGCCCT 
TCAATCGCCAGAGAATCTACGAGATGTATGAAGCGGTTAGTATGC^GCCGTCACTTAGAAGTGAGTATGAGTA 
CCCTGTTTTTTCTCATGTTCAGGCAGGGATGTTCTCACCTAAGCTTAGMCCTTTACCAAAGGTGATGCGGAG 
AGATGGGTMGCACAACCAAAAAAGCCAGTGATTCTGCATTCTGGCTTGAGGTTGAAGGTAATTCCATGACCG 
30 CACCAACAGGCTCCAAGCCAAGCTTTCCTGACGGAATGTTAATTCTCGTTGACCCTGAGCAGGCTGTTGAGCC 
AGGTGATTTCTGCATAGCCAGACTTGGGGGTGATGAGTTTACCTTCAAGAAACTGATCAGGGATAGCGGTCAG 
GTGTTTTTACJVACCACTAAACCCACAGTACCCAATGATCCCATGCAATGAGAGTTGTTCCGTTGTGGGGAAAG 
TTATCGCTAGTCAGTGGCCTGAAGAGACGTTTGGCTGATAGACTAGTGGATCCACTAGTg 1 1 tc tgccc 

3 5 The construct was delivered to the chromosome using a 

recombinant phage caUed MMebg-cI857s7enhanced RBS #4 into F'tet/393. 
After recombination and resolution only the chromosomal insert described 
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above remains in the cell. It was renamed Ftet/GMIOI. Ftet/GMlOl was 
then modified by the delivery of a lacl° construct into the ebg operon 
between nucleotide position 2493 and 2937 as numbered in the Genbank 
accession number M6444lGb_Ba with the deletion of the intervening ebg 
5 sequence. The sequence of the insert is shown below with the lower case 
letters representing the ebg sequences flanking the insert (SEQ ID NO: 

389) shown below: 
w c 9 ,aaacc<^GTCCATC^^ 

1 0 GTTTCCCGCGTGGTGAACCAGGCCAGCCACG^TGCG^^ 

sssssssssss^^Bsssssssssssss 

GGiACGGGAAGGCGACTGGAGTGCCATC 
CCCACTGCC»TGCTGGTTGCC^CGATCAGATGGCGC 

GTAAGGTACCATAGGATCCaggcacagga 
2 5 The construct was delivered to the chromosome using a 

recombinant phage called AGebg-LacIQ#5 into Ftet/GMIOI. After 
recombination and resolution only the chromosomal insert described 
above remains in the cell. It was renamed Ftet/GM221. The Ftet episome 
was cured from the strain using acridine orange at a concentration of 25 
Hg/ml in LB. The cured strain was identified as tetracyline sensitive and 
was stored as GM221. 

session. Cultures of P AMG21-Fc-TMP-TMP in Kcoli GM221 in 
35 Luria Broth medium containing 50 ug/ml kanamycin were incubated at 
37°C prior to induction. Induction of Fc-TMP-TMP gene product 
expression from the luxPR promoter was achieved following the addition 
of the synthetic autoinducer N-O^xohexanoyD-DL-homoserine lactone to 
the culture media to a final concentration of 20 ng/ml and cultures were 
40 incubated at 37°C for a further 3 hours. After3 hours, the bacterial 
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in the same buffer ranging from 150 mM NaCl to 400 mM NaCl. The peak 

is pooled and filtered. 

Characterization of Fc-TMP activity . The following is a summary of 

in vivo data in mice with various compounds of this invention. . 
5 Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 

started 4 days apart for a total of 20 mice per group. Five mice bled at each 

time point, mice were bled a minimum of three times a week. Mice were 

anesthetized with isoflurane and a total volume of 140-160 jil of blood was 
1 0 obtained by puncture of the orbital sinus. Blood was counted on a 

Technicon HIE blood analyzer running software for murine blood. 

Parameters measured were white blood cells, red blood cells, hematocrit, 

hemoglobin, platelets, neutrophils. 

Treatments: Mice were either injected subcutaneously for a bolus 
1 5 treatment or implanted with 7-day micro-osmotic pumps for continuous 

delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 

Osmotic pumps were inserted into a subcutaneous incision made in the 

skin between the scapulae of anesthetized mice. Compounds were diluted 

in PBS with 0.1% BSA. All experiments included one control group, 
2 0 labeled "carrier" that were treated with this diluent only. The 

concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 

the graphs. 

Compounds: A dose titration of the compound was delivered to 
2 5 mice in 7 day micro-osmotic pumps. Mice were treated with various 
compounds at a single dose of 100 yg/kg in 7 day osmotic pumps. Some 
of the same compounds were then given to mice as a singleTbolus injection. - 

Activity test results: The results of the activity experiments are 
shown in Figures 11 and 12. In dose response assays using 7-day micro- 
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osmotic pumps, the maximum effect was seen with the compound of SEQ 
ID NO: 18 was at 100 ^g/kg/day; the 10 jig/kg/day dose was about 50% 
maximally active and 1 ng/kg/day was the lowest dose at which activity 
could be seen in this assay system. Hie compound at 10 jig/kg/day dose 
was about equally active as 100 ng/kg/day unpegylated rHu-MGDF in 
the same experiment. 
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Example 3 
Fc-EMP fusions 

Fc-EMP . A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the EPO-mimetic peptide was constructed 
5 using standard PGR technology. Templates for PCR reactions were a 
vector containing the Fc sequence (pFc-A3, described in International 
application WO 97/23614, published July 3, 1997) and a synthetic gene 
encoding EPO monomer. The synthetic gene for the monomer was 
constructed from the 4 overlapping oligonucleotides (SEQ ID NOS: 390 to 
1 0 393, respectively) shown below: 

1798-2 TAT GAA AGG TGG AGG TGG TGG TGG AGG TAC TTA CTC TTG 
CCA CTT CGG CCC GCT GAC TTG G 

15 1798-3 CGG TTT GCA AAC CCA AGT CAG CGG GCC GAA GTG GCA AGA 
GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT CAT 



20 



45 



1798-4 GTT TGC AAA CCG CAG GGT GGC GGC GGC GGC GGC GGT GGT 
ACC TAT TCC TGT CAT TTT 

1798-5 CCA GGT CAG CGG GCC AAA ATG ACA GGA ATA GGT ACC ACC 
GCC GCC GCC GCC GCC ACC CTG 



The 4 oligonucleotides were annealed to form the duplex encoding an 

2 5 amino add sequence (SEQ ID NOS: 394 and 395, respectively) shown 

below: 

TATGAAAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTG 
1 + + + + + + 60 

3 0 TACTTTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAAC 

b MKGGGGGGGTYSCHFGPLTW- 

GGTTTGC AAAC C GC AGGGTGGC GGCGGC GGC GGCGGTGGT ACC T ATTC CTGTC ATTTT 

3 D CCAAACGTTTGtGCG^CCCACCGCCGCCG^CGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACC 
b VCKPQGGGGGGGGTYSCHF 

This duplex was amplified in a PCR reaction using 

40 1798-18 GCA GAA GAG CCT CTC CCT GTC TCC GGG TAA 

AGG TGG AGG TGG TGG TGG AGG TAC TTA 
CTC T 



and 



1798-19 CTA ATT GGA TCC ACG AGA TTA ACC ACC 

CTG CGG TTT GCA A 
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as the sense and antisense primers (SEQ ID NOS: 396 and 397, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

5 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1798-17 AGA GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT ACC CGG 

AGA CAG GGA GAG GCT CTT CTG C 

10 

which are SEQ ID NOS: 398 and 399, respectively. The oligonucleotides 
1798-17 and 1798-18 contain an overlap of 61 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
above PCR products in a third reaction using the outside primers, 1216-52 
15 and 1798-19. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 
into the vector pAMG21 (described below), also digested with Xbal and 
Bam HI. Ligated DNA was transformed into competent host cells of E. coli 
2 0 strain 2596 (GM221, described herein). Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3718. 

The nucleotide and amino acid sequence of the resulting fusion 
2 5 protein (SEQ ID NOS: 15 and 16) are shown in Figure 13. 

EMP-Fc . A DNA sequence coding for a monomer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFC-A3a vector and a synthetic gene encoding EPQ monomer. 
30 The synthetic gene for the monomer was constructed from the 4 

overlapping oligonucleotides 17984 and 1798-5 (above) and 1798-6 and 
1798-7 (SEQ ID NOS: 400 and 401, respectively) shown below: 



)0<j 
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35 



1798-6 GGC CCG CTG ACC TGG GTA TGT AAG CCA CAA GGG GGT GGG 
GGA GGC GGG GGG TAA TCT CGA G 

5 1798-7 GAT CCT CGA GAT TAC CCC CCG CCT CCC CCA CCC CCT TGT 
GGC TTA CAT AC 

The 4 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 402 and 403, respectively) shown 
10 below: 

GTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGC 

i _._....-- + -»---«-»«+*-----"-+-"*-""""" + """ - ""+**"***"'*"*^" 60 
GTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCG 
15 A vCKPQGGGGGGGGTYSCHFG 

CCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGTAATCTCGAG 

61 + + + + + + - 1: 

GGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCATTAGAGCTCCTAG 

PLTWVCKPQGGGGGGG* 

This duplex was amplified in a PCR reaction using 

1798-21 TTA TTT CAT ATG AAA GGT GGT AAC TAT TCC TGT CAT TTT 

and 

1798-22 TGG ACA TGT GTG AGT TTT GTC CCC CCC GCC TCC CCC ACC 

CCC T 

as the sense and antisense primers (SEQ ID NOS: 404 and 405, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

1798-23 AGG GGG TGG GGG AGG CGG GGG GGA CAA AAC TCA CAC ATG 

TCC A 



20 A 



25 



and 

40 1200-54 GTT ATT GCT CAG CGG TGG CA 

which are SEQ ID NOS: 406 and 407, respectively. The oligonucleotides 
1798-22 and 1798-23 contain an overlap of 43 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
4 5 above PCR products in a third reaction using the outside primers, 1787-21 
and 1200-54. 

The final PCR gene product (the full length fusion gene) was 

digested with restriction endonucleases Xbal and BamHI, and then ligated 

//0 
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into the vector p AMG21 and transformed into competent Kcoli strain 
2596 cells as described above. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
5 and designated Amgen strain #3688. 

The nucleotide and amino acid sequences (SEQ ID NOS: 17 and 18) 
of the resulting fusion protein are shown in Figure 14. 

EMP-EMP-Fc . A DNA sequence coding for a dimer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
1 o constructed using standard PCR technology. Templates for PCR reactions 
were the EMP-Fc plasmid from strain #3688 above and a synthetic gene 
encoding the EPO dimer. The synthetic gene for the dimer was 
constructed from the 8 overlapping oligonucleotides (SEQ ID NOS:408 to 
415, respectively) shown below: 



15 



20 



25 



30 



35 



1869- 


23 


TTT TTT ATC GAT TTG ATT CTA GAT TTG AGT TTT AAC TTT 
TAG AAG GAG GAA TAA AAT ATG 


1869- 


48 


TAA AAG TTA AAA CTC AAA TCT AGA ATC AAA TCG ATA AAA 
AA 


1871- 


■72 


GGA GGT ACT TAG TCT TGC CAC TTC GGC CCG CTG ACT TGG 
GTT TGC AAA CCG 


1871- 


•73 


AGT CAG CGG GCC GAA GTG GCA AGA GTA AGT ACC TCC CAT 
ATT TTA TTC CTC CTT C 


1871- 


-74 


CAG GGT GGC GGC GGC GGC GGC GGT GGT ACC TAT TCC TGT 
CAT TTT GGC CCG CTG ACC TGG 


1871- 


-75 


AAA ATG ACA GGA ATA GGT ACC ACC GCC GCC GCC GCC GCC 
ACC CTG CGG TTT GCA AAC CCA 


1871 


■78 


GTA TGT AAG CCA CAA GGG GGT GGG GGA GGC GGG GGG GAC 
AAA ACT CAC ACA TGT CCA 


1871 


-79 


AGT TTT GTC CCC CCC GCC TCC CCC ACC CCC TTG TGG CTT 
ACA TAC CCA GGT CAG CGG GCC 



40 The 8 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 416 and 417, respectively) shown 
below: 

rrTTTTATCGATTTGATTCTAGA^^ 6Q 
45 1 AAAAAATWXTAAACT^ 

M 

ill 
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GGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTTGCAAACCGCAGGGTGGC 

61 + + + + + + 120 

CCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAACGTTTGGCGTCCCACCG 
5 a GGTYSCHFGPLTWVCKPQGG 

GGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAG 

121 + ♦ + + "-' + + 180 

CCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTC 

10 a GGGGGGTYSCHFGPLTWVCK 

CCACAAGGGGGTGGGGGAGGCGGGGGGGACAAAACTCACACATGTCCA 

181 + + + + 228 

GGTGTTCCCCCACCCCCTCCGCCCCCCCTGTTTTGA 
15 a PQGGGGGGGDKTHTCP 

This duplex was amplified in a PCR reaction using 1869-23 and 
1871-79 (shown above) as the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
2 0 with strain 3688 DNA using the primers 1798-23 and 1200-54 (shown 
above). 

The oligonucleotides 1871-79 and 1798-23 contain an overlap of 31 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 

2 5 using the outside primers, 1869-23 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamH L and then ligated 
into the vector pAMG21 and transformed into competent E.coli strain 
2596 cells as described for Fc-EMP. Clones were screened for ability to 

3 0 produce the recombinant protein product and possession of the gene 

fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3813. 

The nucleotide and amino acid sequences (SEQ ED NOS: 19 and 20, 
respectively) of the resulting fusion protein are shown in Figure 15. There 
35 is a silent mutation at position 145 (A to G, shown in boldface) such that 
the final construct has a different nucleotide sequence than the 
oligonucleotide 1871-72 from which it was derived. 

Fc-EMP-EMP . A DNA sequence coding for the Fc region of human 
IgGl fused in-frame to a dimer of the EPO-mimetic peptide was 
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constructed using standard PCR technology. Templates for PCR reactions 
were the plasmids from strains 3688 and 3813 above. 

The Fc portion of the molecule was generated in a PCR reaction 
with strain 3688 DNA using the primers 1216-52 and 1798-17 (shown 
5 above). The EMP dimer portion of the molecule was the product of a 
second PCR reaction with strain 3813 DNA using the primers 1798-18 (also 
shown above) and SEQ ID NO: 418, shown below: 

1798-20 CTA ATT GGA TCC TCG AGA OTA ACC CCC TTG TGG CTT ACAT 

° The oligonucleotides 1798-17 and 1798-18 contain an overlap of 61 

nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1798-20. 

5 The final PCR gene product (the full length fusion gene) was 

digested with restriction endonudeases Xbal and BamM, and then ligated 
into the vector pAMG21 and transformed into competent E.coli strain 
2596 cells as described for Fc-EMP. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #3822. 

The nucleotide and amino acid sequences (SEQ ID NOS: _ and _, 
respectively) of the fusion protein are shown in Figure 16. 

Characterization nf Fc-EMP activity . Characterization was carried 

5 out in vivo as follows. 

Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 
started 4 days apart for a total of 20 mice per group. Five mice bled at 
each time point, mice were bled a maximum of three times a week. Mice 

1 0 were anesthetized with isoflurane and a total volume of 140-160 ml of 

blood was obtained by puncture of the orbital sinus. Blood was counted 

\0 
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on a Technicon HIE blood analyzer running software for murine blood. 
Parameters measured were WBC, RBC, HCT, HGB, PLT, NEUT, LYMPH. 

Treatments: Mice were either injected subcutaneously for a bolus 
treatment or implanted with 7 day micro-osmotic pumps for continuous 
5 delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 
Osmotic pumps were inserted into a subcutaneous incision made in the 
skin between the scapulae of anesthetized mice. Compounds were diluted 
in PBS with 0.1% BSA. All experiments included one control group, 
labeled "carrier" that were treated with this diluent only. The 
1 0 concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 
the graphs. 

Experiments: Various Fc-conjugated EPO mimetic peptides (EMPs) 
were delivered to mice as a single bolus injection at a dose of 100 ng/kg. 
1 5 Fc-EMPs were delivered to mice in 7-day micro-osmotic pumps. The 
pumps were not replaced at the end of 7 days. Mice were bled until day 
51 when HGB and HCT returned to baseline levels. 

Example 4 
TNF-a inhibitors 

2 0 Fc-TNF-a inhibitors . A DNA sequence coding for the Fc region of 

human IgGl fused in-frame to a monomer of the TNF-a inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

2 5 primer 1216-52 and the antisense primer 2295-89 (SEQ ID NOS: 11 12 and 
1113 , respectively). The nucleotides encoding the TNF-a inhibitory 
peptide were provided by the PCR primer 2295-89 shownl>elow: 



30 



1216-52 
2295-89 



AAC ATA AGT ACC TGT AGG ATC G 

CCG CGG ATC CAT TAG GGA CGG TGA CCC AGA GAG GTG TTT TTG TAG 
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10 



15 



20 



25 



30 



TGC GGC AGG AAG TCA CCA CCA CCT CCA CCT TTA CCC 

The ohgonucleotide 2295-89 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction e*donucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent RcoU strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4544. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1055 and 
1056) of the fusion protein are shown in Figures 19A and 19B. 

TMP-n inhibitor-Fc. A DNA sequence coding for a TNF-a inhibitory 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc The nucleotides encoding the TNF-a inhibitory peptide were 
provided by the sense PCR primer 2295-88, with primer 1200-54 serving as 
the antisense primer (SEQ ID NOS: 1117 and 407, respectively). The 
primer sequences are shown below: 

SSSSSSSSSSS"— — ~ 

1200-54 GTT ATT GCT CAG CGG TGG CA 

The oligonucleotide 2295-88 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

IIT 
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The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and Bam HI. and then ligated into the 
vector pAMG21 and transformed into competent Ecoli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
5 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4543. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1057 and 1058) of 
the fusion protein are shown in Figures 20A and 20B. 
1 0 Expression in E. roli. Cultures of each of the pAMG21-Fc-fusion 

constructs in Ecoli GM221 were grown at 37 °C in Luria Broth medium 
containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 
1 5 culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 
cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 
2 0 were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by ^suspension in Laemmli sample buffer containing 
10% 8-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 
25 Purification of Fc-pepti de fusion p rotpins Cells are broken in water 

(1/10) by high pressure homogenization (2 passes at 14,000 PSI) and 
inclusion bodies are harvested by centrifugation (4200 RPM in J-6B for 1 , • 
hour). Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM 
DTT, pH 8.7 for 1 hour at a 1/10 ratio. The solubilized mixture is diluted 
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20 times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 
10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 
5 acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5 (lOmg/ml protein load, room temperature). 
The protein is eluted from the column using a 20 column volume gradient 
in the same buffer ranging from lOOmM NaCl to 500mM NaCl. The pool 

1 0 from the column is diluted 3 fold and loaded onto a SP-Sepharose HP 
column in 20mM NaAc, 150mM NaCl, pH 5(10mg/ml protein load, room 
temperature). The protein is eluted using a 20 column volume gradient in 
the same buffer ranging from 150mM NaCl to 400mM NaCl. The peak is 
pooled and filtered. 

15 Characterization of activity of Fc-TNF-a inhibitor and TNF-ct 

inhibitor -Fc . Binding of these peptide fusion proteins to TNF- a can be 
characterized by BIAcore by methods available to one of ordinary skill in 
the art who is armed with the teachings of the present specification. 

Example 5 

20 H-l Antagonist 

Fc-IL-1 antagonist . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an IL-1 antagonist peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 

2 5 from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

primer 1216-52 and the antisense primer 2269-70 (SEQ ID NOS: 1112 and 
1118, respectively). The nucleotides encoding the IL-1 antagonist peptide - 
were provided by the PCR primer 2269-70 shown below: 



to 
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1216-52 AAC ATA A6T ACC TGT AGG ATC G 

2269-70 CCG CGG ATC CAT TAC AGC GGC AGA GCG TAC GGC TGC CAG TAA CCC 

GGG GTC CAT TCG AAA CCA CCA CCT CCA CCT TTA CCC 



The oligonucleotide 2269-70 overlaps the glycine linker and Fc.portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

10 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Nde l and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

1 5 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4506. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1059 and 
1060) of the fusion protein are shown in Figures 21A and 21B. 

IT-1 antaponist-Fc . A DNA sequence coding for an IL-1 antagonist 

2 0 peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the IL-1 antagonist peptide were provided 
by the sense PCR primer 2269-69, with primer 1200-54 serving as the 

2 5 antisense primer (SEQ ID NOS: 1119 and 407, respectively). The primer 
sequences are shown below: 



2269-69 GAA TAA CAT ATG TTC GAA TGG ACC CCG GGT TAC TGG CAG CCG TAC GCT 

3 o CTG CCG CTG GGT GGA GGC GGT GGG GAC AAA ACT 

1200-54 GTT ATT GCT CAG CGG TGG CA 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHL and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

1 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4505. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1061 and 
1062) of the fusion protein are shown in Figures 22A and 22B. Expression 
and purification were carried out as in previous examples. 

15 Characterization of Fc-IL-1 antagonist pept ide and IL-1 antagonist 

pe pride-Fc activity . IL-1 Receptor Binding competition between IL-lp, IL- 
1RA and Fc-conjugated IL-1 peptide sequences was carried out using the 
IGEN system. Reactions contained 0.4 nM biotin-IL-lR + 15 nM IL-l-TAG 
+ 3 uM competitor + 20 ug/ml streptavidin-conjugate beads, where 

2 0 competitors were IL-1RA, Fc-IL-1 antagonist, IL-1 antagonist-Fc). 

Competition was assayed over a range of competitor concentrations from 
3 uM to 1.5 pM. The results are shown in Table C below: 



lit 
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Table C— Results from IL-1 Receptor Binding Competition Assay 



IL-1pep-Fc Fc-IL-1pep IL-1ra 

5 Kl 281.5 59.58 1.405 

EC50 530.0 112.2 2.645 

95% Confidence Intervals 

10 EC50 280.2 to 1002 54.75 to 229.8 1.149 to 

6.086 

Kl 148.9 to 532.5 29.08 to 122.1 0.6106 to 

3.233 

15 

Goodness of Fit 

Ra 0.9790 0.9687 0.9602 
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Example 6 
yECF-Anta gonists 
Fr-VF.CF Antagonist . A DNA sequence coding for the Fc region of 
5 human IgGl fused in-frame to a monomer of the VEGF mimetic peptide 
was constructed using standard PCR technology. The templates for the 
PCR reaction were the P Fc-A3 plasmid and a synthetic VEGF mimetic 
peptide gene. The synthetic gene was assembled by annealing the 
following two oligonucleotides primer (SEQ ID NOS: 1120 and 1121, 
10 respectively): 

2293-11 GTT GAA CCG AAC TGT GAC ATC CAT GTT ATG TGG GAA TGG GAA 
TGT TTT GAA CGT CTG 

2293-12 CAG ACG TTC AAA ACA TTC CCA TTC CCA CAT AAC ATG GAT GTC 
!5 ACA GTT CGG TTC AAC 

The two oligonucleotides anneal to form the following duplex encoding 
amino acid sequence shown below (SEQ ID NOS 1122 ): 



an 



GTTGAACO 



GAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTG 



1 +- 



- + 57 



CAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCAGAC 
25 a vEPNCDIHVMWBWECFERL 

This duplex was amplified in a PCR reaction using 2293-05 and 22934)6 as 
the sense and antisense primers (SEQ ID NOS. 1125 and 1126). 
30 The Fc portion of the molecule was generated in a PCR reaction 

with the pFc-A3 plasmid using the primers 22934)3 and 2293-04 as the 
sense and antisense primers (SEQ ID NOS. 1123 and ll^^respectively). 
The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-03 and 2293-06. These primers are shown below: 
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2293-03 



ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GAC AAA ACT CAC 



ACA TGT 



5 



2293-04 



GTC ACA GTT CGG TTC AAC ACC ACC ACC ACC ACC TTT ACC CGG 



AGA CAG GGA 



2293-05 



TCC CTG TCT CCG GGT AAA GGT GGT GGT GGT GGT GTT GAA CCG 



AAC TGT GAC ATC 



10 



2293-06 



CCG CGG ATC CTC GAG TTA CAG ACG TTC AAA ACA TTC CCA 



The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and Bam HL and then ligated into the 

1 5 vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4523. 

2 0 The nucleotide and amino acid sequences (SEQ ID NOS: 1063 and 

1064) of the fusion protein are shown in Figures 23A and 23B. 

VEGF antagonist -Fc . A DNA sequence coding for a VEGF mimetic 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The templates for the PCR reaction were 

25 the pFc-A3 plasmid and the synthetic VEGF mimetic peptide gene 

described above. The synthetic duplex was amplified in a PCR reaction 
using 2293-07 and 2293-08 as the sense and antisense primers (SEQ ID 
NOS. 1127 and 1128, respectively). 



sense and antisense primers (SEQ ID NOS. 1129 and 1130, respectively). 



The Fc portion of the molecule was generated in a PCR reaction 
3 0 with the pFc-A3 plasmid using the primers 2293-09 and 2293-10 as the 
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The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-07 and 2293-10. These primers are shown below: 

2293-07 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GTT GAA CCG AAC 
5 TGT GAC 

2293-08 ACA TGT GTG AGT TTT GTC ACC ACC ACC ACC ACC CAG ACG TTC 
AAA ACA TTC 

10 2293-09 GAA TGT TTT GAA CGT CTG GGT GGT GGT GGT GGT GAC AAA ACT 
CAC ACA TGT 

2293-10 CCG CGG ATC CTC GAG TTA TTT ACC CGG AGA CAG GGA GAG 

The PCR gene product (the full length fusion gene) was digested 
1 5 with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E.coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
20 and designated Amgen strain #4524. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1065 and 
1066) of the fusion protein are shown in Figures 24A and 24B. Expression 
and purification were carried out as in previous examples. 

25 Example 7 

MMP Inhibitors 

Fr-MMP inhibitor . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an MMP inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
3 0 linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-TNF-a inhibitor fusion strain #4544 (see Example 4) using the 
sense primer 1216-52 and the antisense primer 2308-67 (SEQ ID NOS: 1112 
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and 1131, respectively). The nucleotides encoding the MMP inhibitor 
peptide were provided by the PCR primer 2308-67 shown below: 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

5 

2308-67 CCG CGG ATC CAT TAG CAC AGG GTG AAA CCC CAG TGG GTG GTG 

CAA CCA CCA CCT CCA CCT TTA CCC 

The oligonucleotide 2308-67 overlaps the glycine linker and Fc portion of 

1 0 the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and BamHI, and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 

1 5 described for EMP-Fc herein. Clones were screened for the ability to 

produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4597. 

The nucleotide and amino add sequences (SEQ ID NOS: 1067 and 

2 0 1068) of the fusion protein are shown in Figures 25A and 25B. Expression 
and purification were carried out as in previous examples. 

MMP Inhibitor-Fc A DNA sequence coding for an MMP inhibitory 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The Fc and 5 glycine linker portion of the 

2 5 molecule was generated in a PCR reaction with DNA from the Fc-TNF-a 
inhibitor fusion strain #4543 (see Example 4). The nucleotides encoding 
the MMP inhibitory peptide were provided by the sense PCR primer 2308- 
66, with primer 1200-54 serving as the antisense primer (SEQ ID NOS: 
1132 and 407, respectively). The primer sequences are shown below: 

30 " " 

2308-66 GAA TAA CAT ATG TGC ACC ACC CAC TGG GGT TTC ACC CTG TGC 

GGT GGA GGC GGT GGG GAC AAA 

35 1200-54 GTT ATT GCT CAG CGG TGG CA 

1 31 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 

the template by 24 nucleotides, with the PCR resulting in the two genes 

being fused together in the correct reading frame. 

5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Nde l and BamHI, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 

described for EMP-Fc herein. Clones were screened for the ability to 

produce the recombinant protein product and to possess the gene fusion 

1 0 having the correct nucleotide sequence. A single such clone was selected 

and designated Amgen strain #4598. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1069 and 

1070) of the fusion protein are shown in Figures 26A and 26B. 

* * * 

1 5 The invention now being fully described, it will be apparent to one 

of ordinary skill in the art that many changes and modifications can be 
made thereto, without departing from the spirit and scope of the invention 
as set forth herein. 



20 Abbreviations 

Abbreviations used throughout this specification are as defined 
below, unless otherwise defined in specific circumstances. 

Ac acetyl (used to refer to acetylated residues) 

AcBpa acetylated p-benzoyl-L-phenylalanine 
2 5 ADCC antibody-dependent cellular cytotoxicity 

Aib aminoisobutyric acid 

bA beta-alanine " 

Bpa p-benzoyl-L-phenylalanine 

BrAc bromoacetyl (BrCH^O) 
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BSA Bovine serum albumin 

Bzl Benzyl 

Cap Caproic acid 

CTL Cytotoxic T lymphocytes 

5 CTLA4 Cytotoxic T lymphocyte antigen 4 

DARC Duffy blood group antigen receptor 

DCC Dicylcohexylcarbodiimide 

Dde l-(4 / 4-dimethyl-2,6-dioxo-cyclohexylidene)ethyl 

EMP Erythropoietin-mimetic peptide 

1 0 ESI-MS Electron spray ionization mass spectrometry 

EPO Erythropoietin 

Fmoc fluorenylmethoxycarbonyl 

G-CSF Granulocyte colony stimulating factor 

GH Growth hormone 

15 HCT hematocrit 

HGB hemoglobin 

hGH Human growth hormone 

HOBt 1-Hydroxybenzotriazole 

HPLC high performance liquid chromatography 

20 IL interleukin 

IL-R interleukin receptor 

IL-1R interleukin-1 receptor 

IL-lra interleukin-1 receptor antagonist 

Lau Laurie acid 

25 LPS lipopolysaccharide 

LYMPH lymphocytes 
™ MALDI-MS Matrix-assisted laser desorption ionizatioiLmass 
spectrometry 

Me methyl 

m 
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MeO 


methoxy 


MHC 


major histocompatibility complex 


MMP 


matrix metalloproteinase 


MMPI 


matrix metalloproteinase inhibitor 


1-Nap 


1-napthylalanine 


NEUT 


neutrophils 


NGF 


nerve growth factor 


Nle 


norleucine 


NMP 


N-methyl-2-pyrrolidinone 


PAGE 


polyacrylamide gel electrophoresis 


PBS 


Phosphate-buffered saline 


Pbf 


2 / 2,4/6,7-pendamethyldihydrobenzofuran-5-sulfonyl 


PCR 


polymerase chain reaction 


Pec 


pipecolic acid 


PEG 


Poly(ethylene glycol) 


pGlu 


pyroglutamic acid 


Pic 


picolinic acid 


PIT 


platelets 


pv 


phosphotyrosine 


RBC 


red blood cells 


RBS 


ribosome binding site 


RT 


room temperature (25 °C) 


Sar 


sarcosine 


SDS 


sodium dodecyl sulfate 


STK 


serine-threonine kinases 


t-Boc 


tert-Butoxycarbonyl 


tBu 


tert-Butyl - - 


TGF 


tissue growth factor 


THF 


thymic humoral factor 
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TK 


tyrosine kinase 


TMP 


Thrombopoietin-mimetic peptide 


TNF 


Tissue necrosis factor 


TPO 


Thrombopoietin 


TRAIL 


TTMF~related apoptosis-inducing ligand 


Trt 


trityl 


UK 


urokinase 


UKR 


urokinase receptor 


VEGF 


vascular endothelial cell growth factor 


VIP 


vasoactive intestinal peptide 


WBC 


white blood cells 
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What is claimed is: 

1 . A composition of matter of the formula 

and multimers thereof, wherein: 
5 F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from ~(V)-?\ - 

avp'-a 2 ), -p 2 , -a vpMlv^-o-Vp 3 , and -(LVP'-a VP 2 -(L 3 )e -p 3 - 
a 4 ) r p 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
1 0 pharmacologically active peptides; 

L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
that at least one of a and b is 1. 

2. The composition of matter of Claim 1 of the formulae 
15 X 1 -F 1 

or 

F'-X 2 . 

3. The composition of matter of Claim 1 of the formula 

20 4. The composition of matter of Claim 1 of the formula 

FMLVP^LVP 2 - 

5. The composition of matter of Claim 1 wherein F 1 is an IgG Fc 
domain. 

6. The composition of matter of Claim 1 wherein F 1 is an IgGl Fc 
25 domain. 

7. The composition of matter of Claim 1 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

8. The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
an IL-1 antagonist peptide sequence. 

/*9 
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9. The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 212, 907, 908, 909, 
910, 917, and 979. 

10. The composition of matter of Claim 8 wherein the IL-1 antagonist 
5 peptide sequence is selected from SEQ ID NOS: 213 to 271, 671 to 

906, 911 to 916, and 918 to 1023. 

11. The composition of matter of Claim 8 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

12. The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
10 an EPO-mimetic peptide sequence. 

13. The composition of matter of Claim 12 wherein the EPO-mimetic 
peptide sequence is selected from Table 5. 

14. The composition of matter of Claim 12 wherein F' comprises the 
sequence of SEQ ID NO: 2. 

1 5 15. The composition of matter of Claim 12 comprising a sequence 

selected from SEQ ID NOS: 83, 84, 85, 124, 419, 420, 421, and 461. . 

16. The composition of matter of claim 12 comprising a sequence 
selected from SEQ ID NOS: 339 and 340. 

17. The composition of matter of Claim 12 comprising a sequence 
20 selected from SEQ ID NOS: 20 and 22. 

18. The composition of matter of Claim 3 wherein P 1 is a TPO-mimetic 
peptide sequence. 

19. The composition of matter of Claim 18 wherein P 1 is a TPO-mimetic 
peptide sequence selected from Table 6. 

2 5 20. The composition of matter of Claim 18 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

21. The composition of matter of Claim 18 having a sequence selected ... 
from SEQ ID NOS: 6 and 12. 

22. A DNA encoding a composition of matter of any of Claims 1 to 21 . 

m 
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23. An expression vector comprising the DNA of Claim 22. 

24. A host cell comprising the expression vector of Claim 23. 

25. The cell of Claim 24, wherein the cell is an E.coli cell. 

26. A process for preparing a pharmacologically active compound, 
5 which comprises 

a) selecting at least one randomized peptide that modulates the 
activity of a protein of interest; and 

b) preparing a pharmacologic agent comprising at least one Fc 
domain covalently linked to at least one amino acid sequence 

10 of the selected peptide or peptides. 

27. The process of Claim 26, wherein the peptide is selected in a process 
comprising screening of a phage display library, an Ecoli display 
library, a ribosomal library, or a chemical peptide library. 

28. The process of Claim 26, wherein the preparation of the 
15 pharmacologic agent is carried out by: 

a) preparing a gene construct comprising a nucleic acid 
sequence encoding the selected peptide and a nucleic acid 
sequence encoding an Fc domain; and 

b) expressing the gene construct. 

2 0 29. The process of Claim 26, wherein the gene construct is expressed in 
an E.coli cell. 

30. The process of Claim 26, wherein the protein of interest is a cell 
surface receptor. 

31. The process of Claim 26, wherein the protein of interest has a linear 

25 epitope. 

32. The process of Claim 26, wherein the protein of interest is a 
cytokine receptor. - •- - 

33. The process of Claim 26, wherein the peptide is an EPO-mimetic 
peptide. 
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34. The process of Claim 26, wherein the peptide is a TPO-mimetic 
peptide, 

35. The process of Claim 26, wherein the peptide is an IL-1 antagonist 
peptide. 

5 36. The process of Claim 26, wherein the peptide is an MMP inhibitor 
peptide or a VEGF antagonist peptide. 

37. The process of Claim 26, wherein the peptide is a TNF-antagonist 
peptide. 

38. The process of Claim 26, wherein the peptide is a CTLA4-mimetic 
10 peptide. 

39. The process of Claim 26, wherein the peptide is selected from 
Tables 4 to 20. 

40. The process of Claim 26, wherein the selection of the peptide is 
carried out by a process comprising: 

15 a) preparing a gene construct comprising a nucleic acid 

sequence encoding a first selected peptide and a nucleic acid 
sequence encoding an Fc domain; 
b) conducting a polymerase chain reaction using the gene 
construct and mutagenic primers, wherein 
20 i) a first mutagenic primer comprises a nucleic acid 

sequence complementary to a sequence at or near the 
5' end of a coding strand of the gene construct, and 
ii) a second mutagenic primer comprises a nucleic acid 
sequence complementary to the 3' end of the 
2 5 noncoding strand of the gene construct. 

41. The process of Claim 26, wherein the compound is derivatized. 

42. The process of Claim 26, wherein the derivatized compound 
comprises a cyclic portion, a cross-linking site, a non-peptidyl 
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linkage, an N-terminal replacement, a C-terminal replacement, or a 
modified amino acid moiety. 
43. The process of Claim 26 wherein the Fc domain is an IgG Fc 
domain. 

5 44. The process of Claim 26, wherein the vehicle is an IgGl Fc domain. 

45. The process of Claim 26, wherein the vehicle comprises the 
sequence of SEQ ID NO: 2. 

46. The process of Claim 26, wherein the compound prepared is of the 
formula 

and multimers thereof,wherein: 
F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(LVP 1 , - 

15 (LVP 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
2 0 that at least one of a and b is 1. 

47. The process of Claim 46, wherein the compound prepared is of the 
formulae 

X 1 -F' 



25 



or 

F 1 -X 2 . 

48. The process of Claim 46, wherein the compound prepared is of the 
formulae 

r-(L 1 ) e -p 1 
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or 

49. The process of Claim 46, wherein F 1 is an IgG Fc domain. 

50. The process of Claim 46, wherein F 1 is an IgGl Fc domain. 

51. The process of Claim 46, wherein F 1 comprises the sequence of SEQ 
ID NO: 2. 
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FIG.2A FIG.2B FIG. 2C 
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FIG. 4 



ATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCA 

1 + + + + + + 60 

TACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGT 

MDKTHTC PPC PAPELLGGP3 

GTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCGCTGAGGTC 

61 + + + -•-. + + + 120 

CAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAG 

VFLFPPKPKDTLMISRTPEV 

ACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTG 

121 + + + + + + 180 

TGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCAC 

T CVVVDVSHEDPEVKFNWYV 

GACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACG 

181 + + + + + + 240 

CTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGC 

DGVEVHNAKTKPREEQYNST 

TACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTAC 

241 + + + + + + 300 

ATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATG 

Y. RVVSVLTVLHQDWLNGKEY 

AAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCC 

301 + + + + + + 360 

TTC AC GTTC C AG AGGTTGTTTC GGGAGGGTCGGGGGT AGCTC TTTTGGTAG AGGTTTCGG 

KCKVSNKALPAPIEKTISKA 

AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACC 

361 + + + + + + 420 

TTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGG 

KG QPREPQVYTL PPSRDELT 

AAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 

421 + + + + + + 480 

TTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCAC 

KNQVSLTCLVKGFYPSDIAV 

GAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGAC 

481 + + + + + + 540 

CTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTG 

EWESNGQPENNYKTTPPVLD 

TCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAG 

541 + + + + + + 600 

AGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTC 

SDGSFFLYSKLTVDKSRWQQ 

GGG AACGTC TTC TCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAG 

601 + + + + ♦ + 660 

CCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTC 

GNVFSC SVMHEALHNHYTQK 

AGCCTCTCCCTGTCTCCGGGTAAA 

661 + + •--• 684 

TC GGAGAGGGACAGAGGC CC ATTT 
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FIG. 5 
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FIG. 7 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + + + + + 60 

AGATCTAMCAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTC P * 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + ♦ + + + + 120 

GTGGMCAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRTPEVTCVVVDV3- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HED PEVKFNWYVDGVEVHNA- 

ccaagacaaagccgcgggaggagcagtacaacacx:acgtaccgtgtggtcagcgtcctca 

241 + * + + + + 300 

ggttctgtttcggcck:cctcctcgtcatgttgtcgtgcatggcacaccagtcgcaggagt 
ktk preeqynstyrvv svlt- 

ccgtcctgcaccaggactggctgaatggcaaggagtacaagtgcaaggtctccaacaaag 

301 ♦ + ♦ + + + 360 

ggcaggacgtggtcctgaccgacttaccgttcc^ 

VLHQDWLN GKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC . 

361 + + + + + + 420 

GGGAGGGTC GGGGGT AGCTCTTTTGGT AGAGGTTTC GGTTTCCCGTCGGGGCTCTTGGTG 
LPAPIEKTI 3 K A K G Q PREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + + + ♦ ♦ + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPP3RDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 + ♦ + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFY PSDIAVE WESNGQ P - 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + ♦ + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGS PFLY- 

ACAGCMGCTCACCGTGGACMGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + 720 

ACTACGTACTCCGAGACGTGTTGCITGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLSLS P G K • 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTT 

721 + + + + ♦ + 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAA 
GGGGGIEGPTLRQWLAARA*- 

BamHZ 
I 

AATCTCGAGGATCC 

781 + 794 

TTAGAGCTCCTAGG 
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FIG. 8 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 ♦ + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTC P - 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PC PAPELLGGPSVFLPPPKP- 



CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + ♦ + + ♦ --- + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI SRTPBVTC VVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + ♦ + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + ♦ + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + ♦ + + + 360 

GGC AGG ACGTGGTC CTGACCGACTT ACCGTTCCTC ATGTTCACGTTCC AGAGGTTGTTTC 
VLHQDWLNGKEYKCKVSNKA* 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + ♦ + «■ + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTC 

LPAPIEKTISKAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 ♦ + + ♦ + + 480 

TC C ACATGTGGG ACGGGGGTAGGGC C CT AC TC GACTGGTTCTTGGTCC AGTCGG ACTGG A 
VYTLPP3RDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 - ♦ ♦ + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFYPSDIAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGS FFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSCSV- 

55i T?^T??^T?t^T?T?^*^?^?T^i??? ^?T?Tr. • + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLSLS PGK* 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTG 

721 + ♦ ♦ - + + ♦ 780 

TTCCACCTCCACCACCATAGCrfCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAC 
GGGGGI EGPTLRQWLAARAG- 

GTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCAC 
781 + ♦ ♦ ♦ + ♦ 840 



CACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTG 
GGGGGGGI EG PTLRQWLAAR* 

BamHI 
I 

GCGCATAATCTCGAGGATCCG 

841 + + - 861 

CGCGTATTAGAGCTCCTAGGC 

c A * - 
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Xbal 

TCTAGATTTGTTTTAACTAATTAAAGGAGGMTMCATATGATCGAAGGTCCGACTCTGC 
1 ......«.» + »»»---«»- + * - «»*"***^*""**"*"'*^"*""*****^*"*"*""""^' ^ 0 

AGATCTAMCAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGCGGTGGTGGCGGAGGGGGTGGCATTGAGGGCCCAA 

gi + - + .-- + ♦ + 120 

CAGTCACCGACCGACGAGCACGACCGCCACCACCGCCTCCCCCACCGTAACTCCCGGGTT 
QWLAARAGGGGGGGGI EG P T • 

CCCTTCGCCAATGGCTTGCAGCACGCGCAGGGGGAGGCGGTGGGGACAAAACTCACACAT 

121 + + + + Zl2l + 180 

GGGAAGCGGTTACCGAACGTCGTGCGCGTCCCCCTCCGCCACCCCTGTTTTGAGTGTGTA 

LRQWLAARAGGGGGDKTHTC- 

gtccaccttgcccaotacctgaactcc 

caggtggaacgggtcgtggacttgaggacccccctggcagtcaaaaggagaaggggggtt 

PPCPAPELLGGPSVPLFPPK- 

AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 
241 + + + ♦ + " + 

ttgcx;ttcctgt«x^ctactagagggcctgg<wactccagtctacgcacc^^^ 

P K DTLMISRTPEVTCVVVDV- 

tgac^cacgaagaccctc^ggtcaagttcmctggtacgtggacg^ 
actcgct^ttctgggactccagttcaagttgaccatgcacctgccgcacctccacgtat 

S HEDPEVKPNWYVDGVEVHN- 

ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCC 

ui . + + +•...-.-••+ 420 

TACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACAC^GTCGCAGG 
AKTKPREEQYN3TYRVV SVL- 



TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 

421 + + + + + + «80 

AGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGT 
TVLHQDWLNGKEYKC KV SNK* 

AAGCCCTCCCACXXCCCATCGAGAAAA^^^ _ 

401 + + • -*-* + **** *V" 

TTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTC^ 

ALPAPIEKTISKAK GQPREP- 

CACA(WTGTACACCCTCX:cgCCATCCCC^ATGACK:TGACCAA<^CCAGGTCAGCCTGA 

c 41 4. + + + + + 600 

GTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACT 
QVY TLP PSRDELTKMQVSLT* 

CCT<XrCT05TCAM(XK:TTCTATCCCAGCGACATCCK:COT ^ 

601 <^C<^CCAG^ 

C LVKGFY PS DIAVEWE 3NGQ* 

AGCCCX^GAAOWCTACM^ ^ 

661 TCGTCCTCTTOTGATGTC^ 

PENNYKTTPPVLDSDGS FFL* 

TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTG^A^AGG^ ^ 



721 Xgatgtcg^gagtggcacctgttctcgtccaccgtcgtc 

v «rL*VDKSRWQQGNVFSCS 



781 



YSKLTVDKSRWQQ 

GGCACTACCTACTCCGA^GTGTT^ m 
VMHEALHNHYTQKSL3LSPG 



BanKI 

I 

GTAAATAATGGATCC 

841 ♦ 855 

CATTTATTACCTAGG 
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FIG. 10 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

1 + t + ♦ ♦ + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAAACTCACACATGTCCAC 

61 + + + + + + 120 

CAGTCACCGACCGACGAGCACGACCACCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTG 
QWLAARAGGGGGDKTHTCPP- 

CTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCA 

121 + + + + + + 180 

GAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGT 
CPAPELLGGPSVFLF PPKPK- 

AGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCC 

181 + + ♦ + + + 240 

TCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGG 
DTLMI SRTPEVTCVVVDVSH- 

ACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCA 

241 + + + + + + 300 

TGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGT 
EDPEVKFNWYVDGVEVHNAK- 

AGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCG 

301 + + + + + + 360 

TCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGC 
TKPREEQYNSTYRVVSVLTV- 

TCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCC 

361 + + + + + ♦ 420 

AGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGG 
LHQDWLNGKEYKCKVSNKAL- 

TCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGG 
421 + + + + + 480 

AGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCC 
PAPIEKTISKAKGQPREPQV- 

TGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCC 

481 + + + + + + 540 

ACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGG 
YTLPPSRDELTKNQVSLTCL- 

TGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGG 

541 + + + + + + 600 

ACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCC 
VKGPYPSDIA VEWESNGQPE- 

AGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACA 

601 + + + + + + 660 

TCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGT 
NNYKTTPPVLDSDGS P F L Y S - 

GCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGA 

661 + + + ♦ + + 720 

— CGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACT 

KLTVDKSRWQQGNVFSC~SVM- " 

TGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAT 

721 + + + + ♦ + 780 

ACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTA 
HEALHNHYTQKSLSLS P G K * - 

BamHI 
I 

AATGGATCC 

781 789 

TTACCTAGG 
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r FIG. 13 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

l + + + + HI""* + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTCP- 
CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

12\ + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI SRT.PEVTC VVVDVS- 

GC C AC G AAG ACC CTG AGGTC AAGTTC AACTGGT AC GTGGAC GGC GTGGAGGTGCAT AATG 

181 . + + ♦ + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HBD PEVKPNWYVDGVBVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 
24i + + + + + + 300 

ggttctgtttcggcgccctcctcgtcatgttgtcgtgcatggcacaccagtcgcaggagt 
ktkpreeqynstyrvvsvlt- 

ccgtcctgcaccaggactggctgaatgo:aaggagtacaagtgcaaggtctccaacaaag 

301 + + + + 360 

ggcaggacgtggtcctgaccgacttaccgttcctcatgttcacgttccagaggttgtttc 

VLHQDWLNGKEYKCKVSNKA- 

ccctcccagcccccatcgagaaaaccatctccaaagccaaagggcagccccgagaaccac 

361 * + ♦.......-.+ + + 420 

gggagggtcgggggtagctcttttggtagaggtttc^ 
lp^apiektiskakgqprepq- 

aggtgtacaccctgcccccatcccgggatgagctgaccaagaaccaggtcagcctgacct 

tccacatgtgggacgggggtacwgccctactcgactggttcttggtccagtcggactgga 

VYTLPPSRDELTKNQVSLTC- 

gcctggtcaaaggcttctatcccagcgacatcgccgtggagtgggagagcaatgggcagc 

cggaccagtttccgaagatagggtcgctgtagcggcacctcaccctctcgttacccgtcg 

lvkgpypsdiavewesngqp- 
cggagaacmctacmgaccacgcctcccgtgctggactccgacggctccttcttcctct 

541 + + + + + + 600 

gcctcttgttgatgttctggtgcggagggcacgacctgaggctgccgaggaagaaggaga 

ENNYKTTPPVLDSDGS PPLY- 

acagcaagctcaccgtggacaaga«:aggtggcack:aggggaacgtcct 

gQl + 4- + + + + 660 

tgtcgttcgagtggcacctgttctcgtccaccgtcgtccccttgcagaagagtacgaggc 

SKLTVDKSRWQQGNVPSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

rgl + + ••• + + + --• + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLSL^S P G K • 

AAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTT 

721 + + + + + + 780 

TTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAA 
GGGGGGGTYSCH PGPLTWVC- 

BaxnHI 

I 

GCAAACCGCAGGGTGGTTAATCTCGTGGATCC 

7 gi + + + -• 812 

CGTTTGGCGTCCCACCAATTAGAGCACCTAGG 
K P Q G G * 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24782 



14/37 



PCI7US99/25044 



Xbal 



FIG. 14 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGGAGGTACTTACTCTTGCC 

1 + + ♦ + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCCTCCATGAATGAGAACGG 

MGGTYSCH- 

ACTTCGGCCCGCTGACTTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGGACA 

61 ♦ + -•- + + + + 120 

TGAAGCCGGGCGACTGAACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCCTGT 
FG PLTWVC K PQGGGG'GGGD K - 

AAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC 

121 + + + + + + 1B0 

TTTGAGTGTGT AC AGGTGGAAC GGGTC GTGGACTTGAGGAC CCC CCTGGCAGTC AAAAGG 
THTCP PC PAPELLGGP3VFL- 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCG 

181 + * + + + + 240 

AGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGC 
F PPKPKDTLMI SRTPEVTCV- 

TGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCG 

241 + + + ♦ + + 300 

ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGC 
VVDVS HED PEVKFNWYVDGV- 

TGG AGGTGC AT AATGCC AAGAC AAAGC C GC GGGAGGAGC AGTAC AACAGC AC GTACCGTG 
301 + + + ♦ + ♦ 360 



EVHNAKTKPREBQYN9TYRV- 

TGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCA 

361 ♦ • + + + + + 420 

ACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGT 
VSVLTVLHQDWLNGKBY KCK- 

AGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC 

421 + + + ♦ - + + 480 

TCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCG 
VSNKALPA PIEKTISRAKGQ- 

AGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACC 

481 + + + + + + S40 

TCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTC 

PREPQVYTLPPSRDELTKNQ- 

AGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGG 

541 ♦ + + + + + 600 

TCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCC 
V 3 LTCLVKGFYPSDIAVBWE- 

AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACG 

601 + + ♦ + + + 660 

TCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGC 
SNGQPENNYKTTPPVLDSDG- 

GCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACG 

661 + ♦ + + +......-..+ 720 

CGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGC 
3 F F L Y 3 K L ' T V D K S R W Q'QTG-N V - 

TCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCT 

721 + ♦ + + + + 780 

AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGA 
FSCSVMHEALHNHYTQKSLS- 

BamHI 
I 

CC CTGTCTC CGGGT AAAT AATGGATC C 

781 ♦ * 807 

GGGACAGAGGCCCATTTATTACCTAGG 
L 3 P G K * 



SUBSTITUTE SHEET (RULE 26) 



15/37 

FIG. 15 



Xbal 

TCTAGATTTGAGTTTTAACTTTTAGAAGGAGGMTAAMTATGGGAGGTACTTACTCTTG 

j ......... + ..•-•-••- + -•••-••-• + ""• +...-----.+•-•••••-•+ 60 

AGATCTAAACTCAAAATTGAAMTCTTCCTCCTTATTTTATACCCTCCATGAATGAGAAC 

MGGTYSC 

CCACTTCGGCCCACTGACTTGGGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGG ^ 

6 1 GGTG^^CGWTGACTGAACCCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACC 
HFGPLTWVCKPOGGGGG G G G - 

TACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGG 

121 ♦ + + * + * 180 

ATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCC 
TYSCHFGPLTWVCKPQGGGG 

AGGCGGGGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGG 

181 * • ■ + + + -.. + ...•-..--+ 240 

TCCGCCCCCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCC 
GGGDKTHTCPPCPAPELLGG 

ACCGTCAGTTTTCCTCTTCCC^ ^ 

2 4 1 TGGC AGTC AAAAGG AG^GGGGGGTTTTGXWTTCCTGTGGGAGTACTAGAGGGCCTGGGG 

PSVFLFPPKPKDTLMISRTP- 

TGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTG 

if\<t 360 

ACTCCAGTGTACCCACCACCACCTttACTCG^ 
EVTCVVVDV3HEDPBVKFHW- 



361 



GTACGTGGACGGCGTGGAGGTGCATAATGCCMGACAAAGCCGCGGGAGGAGCAGTACAA ^ 
CATGCACC^CGCACCTCCAC^ 

yvdgvevhnaktkprbeqin- 
cao:acgtaccgtgt(wtcagcgtcctcaccctcctgcaccaggac^ 

421 GTCGraA^ACACCAGTCGCAGGAG^ 

STYRVVSVLTVLHQDWLNGR • 

GGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAA ^ 

EXKCKVSNKALPAPIEKTIS - 

CJUUttfflUUKWJCMCC^^ ^ 

541 OTTTCGOTTTCMGTCGOGGCTCTTGGTCTTCA 

KAKGQPEEPQ v *' r ''^'° 

GCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGC^ ^ 

601 CGACTGGTTOTCGTCCAGTCGGACTGGW 

LTKHQV9LTCLVKGFYPS D I 



481 



L T K H Q 

CGCCGTGGAGT<^GAGCAATGGGCAGCCGGAGAACAACTA^ ^ 

661 GGGGCACCTCACCCTCTCG^ACCCGTCG^CTCTTC 

AVEWESNGQPENNYKTTrfv 

GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTG 
+ + ♦ *""" * * " *1_11L1™A«/. 



721 CGACCmGG^TGCCGAGGWGAAGGAGATGTCGTOG^ 



GCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATCAGaTCT ^ 

CCTCCTCCCc'toGCAGA^GT^^ _ 
QQGNVFSCSVMHEALHNbi 



781 

I 

0 0 



BamHI 



GCAGAAGAGCCTCTCCCTGTCTCCGOTTAAATAATGGATCC ^ 

841 CGTCWTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 
QKSLSIfSPGK* 
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FIG. 16 



Xbal 
I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + ♦ + + + 60 

AGATCTAAAGAAAATTGATTAATTTCCTCCTTATT^^ 
C MD KTHTCP- 

CACCTTGCCCAGCACCTGAACTCCTCGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAAC 

61 + + + + + 120 

GTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTG 
c PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
c KDTLKISRTPEVTCVVVDVS- 

GC C ACGAAGACC CTG A GGTC AAGTTC AACTGGT AC GTGG ACGGC GTGG AGGTGC AT AATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
C HEDPEVKFNWYVDGVEVHNA- 

C C AAGAC AAAGC C GCGGGAGG AGC ACT AC AAC AGC AC GT ACC GTGTGGTCAGC GTC CTCA 

241 + + + ♦ + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
C KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + .--+s^ + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
C VLHQDWLNGKEYKC K V 3 N K A - 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
c LPAPIEKTI SKAKGQ PRE P Q * 

AGGTGTACACCCTGCCTCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + + + + + + 480 

TCCACATGTGGGACGGAGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
C VYTLPPSRDELTKNQV9LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 + ♦ + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
C LVKGPYPSDIAVEWESNGQP* 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GC C TC TTGTTG ATGTTC TGGTGCGG AGGGC AC G AC CTG AGGCTGC C GAGG AAGAAGGAG A 
C ENNYKTTPPVLDSDGSFFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C SKLTVDKSRWQQGNVFSC S V • 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
C MHEALHNHYTQKSLSLS PGK- 

AAGGTGGAGGTGGTG^GGAGGTACTTACTCTTGCCACTTCGGCCCACTX^CTTGGGTTT 

721 + + + + + + 780 

TTCCACCTCCACCACCGCCTCCATGAATGAGAACGGTGAAGCCGGGTGACTGAACCCAAA 
c GGGGGGG'TYSCHFGPL T "* W V C * 

GCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGC 

781 ♦ ♦ + * + + 840 

CGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCG 
C KPQGGGGGGGGTY3CHFGPL- 

BamHX 
I 

TGACCTGGGTATGT AAGCC AC AAGGGGGTT AATCTC GAGGATCC 

841 + + + + 884 

ACTGGACCCATACATTCGGTGTTCCCCCAATTAGAGCTCCTAGG 
c TWVCKPQGG* 
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FIG. 1 7A 



(Aa£H sticky end) 
(position #4358 in pAMG21) 



5 1 GCGTAACGTATGCATGGTCTCC 
3» TGCACGCATTGCATACGTACCAGAGG 



-CCATGCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACT 
•GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA 

• GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC 
-CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTCATCCTGTTTAGGCG 

• CGGGAGCGGATTTGAACGTTGCGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGC 

- GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCC ACCGCCCGTCCTGCGGGCG 

- CATAMCTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT 
-GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA 

Aatu 

- TTCTACAAACTCTTTTGTTTATTTTTCTAAATACATTCAAATATGGACGTCGTACTTAAC 

- AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGC ATGAATTG 

- TTTTAAAGTATGGGCMTCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC 

- AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG 

- GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC 

- CCAAACAACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG 

- TACAGCCTMTATTTTTGAAATAra - 

- ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG • 

- ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTT • 

- TAAGAAAAAGAGAAAACCAATTTAGCAACAAACTAAATAATAAACGATATAAATAAAAAG • 

- gataattatcaactagagaaggaacaattaatggtatgttcatac acgcatgtaaaaata ■ 

- ctattaatagttgatctcttccttgttaattaccatacaagtatgtgcgtacatttttat • 

- aactatctatatagttgtctttctctgaatgtgcaaaactaagcattccgaagccattat ■ 

• ttgatagatatatcaacagaaagagacttacacgttttgattcgtaaggcttcggtaata ■ 

- tagcagtatgaatagggaaactaaacccagtgataagacctgatgatttcgcttctttaa • 

- atcgtcatacttatccctttgatttgggtcactattctggactactaaagcgaagaaatt ■ 

- ttacatttggagattttttat^ • 

- aatgtaaacctctaaaaaataaatgtcgtaacaaaagtttatataaggttaattagccac • 

- AATGATTGGAGTTAGAATAATCTACTATAGGATCATATTTTATTAAATTAGCGTCATCAT - 

- TTACTMCCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA « 

• AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATCAGATTTAACCATAG • 

• TTATAACGGAGGTAAAAAATCCCATTMTAGGTCTTAACTTTATAGTCTAAATTGGTATC ■ 

• MTGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG • 

- TTACTCCTATTTACTAGCGCTCATTTAOTATAAGTGTTACATGGTAAAATCAGTATAGTC • 

ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT- 
TATTCGTAACTAATTATAGTAATAACGAAGATGTCCGAAATTAAAATAATTAATAAGACA- 

AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC" 
TTCACAGCAGCCGTAAATACAGAAAGTATGGGTAGAGAAATAGGAATGGATAACAAACAG- 

GCAAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGACATTTGATTCTAATAA- 
CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT- 
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FIG. 17B 

- ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG - 

- TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC • 

- TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT - 

- ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA - 

- CTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGA - 

- GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT - 



• GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA - 

- CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT - 

- GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA - 
- CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT - 

- ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG - 

- TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC - 

- AACCGCTCTTCACGCTCTTCACGC 3' [SacII sticky end] 
-TTGGCGAGAAGTGCGAGAAGTG 5' (position #5904 in pAMG21) 
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FIG.18A-1 
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FIG.18B-2 
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FIG. 19A 



Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 
GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 
MDKTHTCPPCPAPELL GGP - 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + +- + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

i 0 , + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 



V T C V V V D 



VSHEDPEVKFNWY 



GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC^ ^ 

181 CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGG^ 

VDGVEVHNAKTKPREEQYNS - 
ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTCAATGGCAAGGAG ^ 

241 TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCC^ 

TYRVVSVLTVLHQDWLNGKE - 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA^ ^ 

301 ATGTTCACGTCCAGAGGTTC^ 

YKCKVSNKALPAPIEKTISK • 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATC & ^ q 

361 CGG^TCCCGTCGGGGCTCCTGGTCTCCACATGTGGGACGCK^ 

AKGQPREPQVYTLPPSRDEL - 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCC^ ^ 

421 TGGCTCTOGGTCCAGTCGGACTGGACGGACCA^ 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTC ^ 

4 8 1 CACCTCACCCTCTCGWACCCGTCGGCCTCTTGTTGATGTTC 



VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTC 

---+---------+---- 

CTGAGGCTGCCGAGGAAGAAGGAC 



GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTTCACAAGAGCAGGTG ^ 

1 ctgaggctgJcgaggaagaIggagatgtcgttcgagtggcacctc^ 

DSDGSFFLYSKLTVDKSRWQ - 
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FIG. 19B 

CAGGGGAACGTCTTCTCA^ 
601 qtCCCCTTGCAGMGAGTAM 

Q o n v r s c s v k h b a l b « b y t q - 

MGAGCCTCTCCCTG^^ 

661 ^tcggagagggacagaggcc^ 

K S L S L S P G K G G G G G D F L P H V • 



BamHI 



AAAAACACCTCTCTGGGTCACCGTCCGTAATGGATCC ^ 

721 tttttgtggIgagacccagtggcaggcattacctagg 

KNTSLGHRP* 
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FIG. 20A 



Ndel 

CATATGGACTTCCTGCCGCACTACAAAAACACCTCTCTGGGTCACCGTCCGGGTGGAGGC 

I + + + + + + 60 

GTATACCTGAAGGACGGCGTGATGTTTTTGTGGAGAGACCCAGTGGCAGGCCCACCTCCG 

MDFLPHYKNTSLGHRPGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

61 + + + + + + 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 + + + - + + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPP KPKDTLMI SRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

181 + + + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VT CVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

241 + + + + + + 300 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHN AKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 + + + + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

361 + + + + + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPI EKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

421 + + + + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTbPPSRDEL 

-ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

4Q1 + + + + - -"- - + +• 540 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVK-GFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

541 + + + + + + 600 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

SUBSTITUTE SHEET (RULE 26) 
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FIG. 20B 

GACTCCGACGGCTCCTTCTTCCTCT^^^ 660 
601 ctgaggctkcgaggam^^ 

DSDGSFFLYSKLTVDKSRWQ • 
CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTC ?2o 
661 GTCCCCTTG^AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 
QGNVFSCSVMHEALHNHYTQ - 



BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCCGCGG ^ 
721 TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGGCGCC 
KSLSLSPGK* 
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FIG. 21A 



Ndel 
I 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQV YTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + ♦...-.-... + * + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPS DIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + ,.540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTT PPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 
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FIG.21B 

CAGGGGMCGTCTTCTCATGCTCCGTGA^^ 

601 gtcCCCTTGCAGAAGAG^ACGAGGCACTACG^^ 

N v F S C S V M H E A L H N H Y T Q ■ 



Q G 



AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTTCGWTG^CCCCGGGT ^ ^ 

661 TTCTCGGAGAGGGACAGA^CCCATTTCCACCTC^ 

K s L S L S P G K G G G G G F E W T P G • 



BamHI 



TACTGGCAGCCGTACGCTCTGCCGCTGTAATGGATCCCTCGAG ^ ^ 

721 atgaccgtc^gcatgcgagIcggcgacattacctagggagctc 



Y W <3 



p Y A L P L * 
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FIG. 22A 



Ndel 



CATATGTTCGAATGGACCCCGGGTTACTGGCAGCCGTACGCTCTGCCGCTGGGTGGAGGC 

1 + + + + + + 60 

GTATACAAGCTTACCTGGGGCCCAATGACCGTCGGCATGCGAGACGGCGACCCACCTCCG 

MFEWTPGYWQPYALPLGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

61 + + + + + + 120 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 + + + + + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

181 + + + + + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEV KFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

24i + + + + + + 300 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

361 + ♦ + + - + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YK CKVSNKALPAPI EKT I SK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

421 + + + + ---- + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

481 + + + + ---+--- + 540 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG . 

TKNQVS LTCLVKGFYPSDIA 

GTGGAGTGGGAGAGC AATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

541 ••••••--• + -- -- -- *- * + ** - "** - "" + "**~**"*"^'""**"*"*"^*"*"""***^' 600 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 
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FIG. 22B 



GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + + + + + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ - 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 + + * + + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ - 

BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 ---- * + + 757 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

KSLSLSPGK* 
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FIG. 23A 



Ndel 



CATATGGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCG 

x + + + + +: + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE. 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

12 x + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + .-.- + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKP. REEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

24X + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLN GKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCC AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTL PPSRDEL - 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

42X + + + + + ""~ + 480 

TGGTTCTTGGTCC AGTC GGACTGGACGGACC AGTTTCC GAAGATAGGGTCGCTGTAGCGG 



T K 



NQVSLTCLVKGFYPSDIA 



GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACTCCTCCCGTGCTG 

401 . ......4.--. .---•• + -- -*--*-- + + . + ._-- + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG ^ 

541 CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSPFIiVSKLTVDKSRWQ - 
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FIG. 23B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGTGGTGGTGGTGTTGAACCGAACTGTGAC 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCACCACCACCACAACTTGGCTTGACACTG 

KSLSLSPGKGGGGGVEPNCD 

BamHI 

I 

ATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTGTAACTCGAGGATCC 

721 + + + + + 773 

TAGGTACAATACACCCTTACCCTTACAAAACTTGCAGACATTGAGCTCCTAGG 

I HVMWEWECFERL * 
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FIG. 24A 

Ndel 



1 + + + + + + 60 

GTATACCAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCA 

MVEPNCD IHVMWEWECFER 

CTGGGTGGTGGTGGTGGTGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTC 

61 + + + + + 120 

GACCCACCACCACCACCACTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAG 

LGGGGGDKTHTCPPCPAPEL 

CTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCC 

121 + + + + + + 180 

GACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGG 

LGGPSVFLFPPKPKDTLMIS 

CGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAG 

181 + + + + + + 240 

GCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC 

RTPEVTCVVVDVSHEDPEVK 

TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAG 

241 + + + + + + 300 

AAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTC 

FNWYVDGVEVHNAKTKPREE 

CAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 

301 + + + + + + 3 60 

GTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGAC 

QYNSTYRVVSVLTVLHQDWIi 

AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA 

361 + + + + --...-... + ...-.--..+ 420 

TTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT 

NGKEYKCKVSNKAtiPAPIEK 

ACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCC 

421 + + + + + + 480 

TGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGG 

TISKAKGQPREPQVYTLPPS 

CGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCC . 

481 + + + .....-...+ + : + '540 

GCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGG 



RDELTKNQVSLTCLVKGFYP 

AGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACG 

+ + + + + + 600 

TCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGC 

SDIAVEWESNGQPENNYKTT 
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FIG. 24B 

CCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAG 

601 + + + + + + 660 

GGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTC 

PPVLDSDGSFFLYSK LTVDK 

AGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAAC 

661 + + + + + + 720 

TCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTG 

SRWQQGNVFSCSVMHEALHN - 

BamHI 
I 

CACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAACTCGAGGATCC 

721 + + + + ---. + .-. 773 

GTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTGAGCTCCTAGG 

HYTQKSLSLSPGK* 
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FIG. 25A 

Ndel 



I 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPE LLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI S RTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDV SHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCAT6TTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + +• + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQD WLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 — + + --. + + + : + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 



DSDGSFFLYSKLTVDKSRWQ 
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FIG. 25B 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 
GTCCCCTTGCAGMGAGTAC^ 

QGNVFSCSVMHEALHNHYTQ - 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTGCACCACCCACTGGGGT 

ggi + + + + .-. + -------••+ 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAACGTGGTGGGTGACCCCA 

KSLSLSPGK GGGGGCTTHWG - 
BamHI 

TTCACCCTGTGCTAATGGATCCCTCGAG 

721 + + 748 

AAGTGGGACACGATTACCTAGGGAGCTC 

F T L C * 
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FIG. 26A 

Ndel 



I 

CATATGTGCACCACCCACTGGGGTTTCACCCTGTGCGGTGGAGGCGGTGGGGACAAAGGT 

1 + + + + + + 60 

GTATACACGTGGTGGGTGACCCCAAAGTGGGACACGCCACCTCCGCCACCCCTGTTTCCA 

MCTTHWGFTLCGGGGGDKG 

GGAGGCGGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGG 

61 + + + + + + 120 

CCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCC 

GGGGDKTHTCPPCPAPEliLG 

GGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACC 

121 + + + + + + 180 

CCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGG 

GPSVFLFPPKPKDTLMISRT 

CCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAAC 

181 + + + + + + 240 

GGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTtCAAGTTG 

PEVTCVVVDVSHBDPEVKFN 

TGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC 

241 + + + + + + 300 

ACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATG 

WYVDGVEVHNAKTKPREEQY 

AACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGC 

301 + + + + -* + + 360 

TTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCG 

NSTYRVVSVLTVLHQDWLNG 

AAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATC 

TTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAG 

KEYKCKVSNK ALPAPIEKTI 

TCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGAT 

421 + + + + + + 480 

AGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTA 

SKAKGQPREPQVYTLPPSRD 

GAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGAC 

481 ♦ + + + + -~ + ~ 540 

CTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTG 

ELTKNQVSLTCLVKGFYPSD 

ATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCC 

541 + + + + + + 600 

TAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGG 



IAVEWESNGQPENNYKTTPP 
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FIG. 26B 



GTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGG 

601 + + + + + + 660 

CACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCC 

V LDSDGSFFLYS KLTVDKSR 

TGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 

661 + + ♦ + * + 720 

ACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATG 

WQQGNVFSCSVMHEALHNHY 

BamHI 
I 

ACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 + + + + * " " 763 

TGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

TQKSLSLSPGK* 
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